Intelligently summarizing and presenting textual responses with machine learning

ABSTRACT

This disclosure relates to methods, non-transitory computer readable media, and systems apply machine-learning techniques and computational sentiment analysis to summarize sentences extracted from a group of textual responses or to select representative-textual responses from the group of textual responses. By using a response-extraction-neural network to extract (and sometimes paraphrase) sentences from textual responses, the disclosed methods, non-transitory computer readable media, and systems can generate a response summary of textual responses based on sentiment indicators corresponding to the textual responses. By applying a machine-learning classifier to generate textual quality scores for textual responses, the disclosed methods, non-transitory computer readable media, and systems select representative-textual responses from a group of textual responses based on relevancy parameters and sentiment indicators corresponding to the textual responses. Such computational techniques generate response summaries and representative responses that provide an efficient snapshot of a group of textual responses analyzed by machine learners.

BACKGROUND

Comment-review systems currently display categorized or popular textualcomments on various software applications and websites for the purchaseor review of products and services. For example, comment-review systemsoften surface popular reviews of places and products oncrowd-source-review forums or electronic-commerce sites on the Internet.To display such categorized or popular textual comments, somecomment-review systems select comments by users based on (i) verifyingthe user purchased or used a reviewed item, (ii) views of (or votes for)comments posted via an application or website, or (iii) keyword searchesor frequently answered questions from users. Despite facilitating accessto comments on such applications and websites, existing technicalapplications limit conventional comment-review systems to surfacingcomments unrepresentative of a larger body of comments, hiding other ormore common comments behind layers of user interfaces, or retrievingcomments that match narrow or rigid filters. The following paragraphsdescribe some of these technical deficiencies.

By relying on verification or voting to select textual comments, someconventional comment-review systems select textual comments that areoutliers in comparison to other comments received by an application orsite. While outlier comments can be helpful, a comment-review system mayapply an algorithm that inadvertently selects them for display becausethe comments are humorous, entertaining, or unusually detailed. Suchcomments can mislead users when analyzing or reviewing a body ofcomments. By selecting for comments based on voting or views,conventional comment-review systems can also surface textual commentsthat capture isolated user experiences unrepresentative of the body ofcomments, such as reviews describing unusually good or bad experienceswith a product—including one-of-a-kind manufacturing defects or serviceexperiences.

In addition to displaying outlier comments, some existing comment-reviewsystems include filtering or search functions that return textualcomments defined by narrow search algorithms or comments hand selectedby humans. For instance, comment-review systems sometimes includekeyword-search functions that identify textual comments matchingkeywords entered by users or synonyms of such keywords. As a furtherexample, comment-review systems can display comments selected or enteredby users in response to frequently asked questions (“FAQ s”). Butdisplaying outlier comments, keyword-search functionalities, anduser-selected answers can bury or obscure more common textual commentsfrom users. Conventional comment-review systems often provide usersaccess to more common comments only with multiple selectable options anduser interfaces through which a user must navigate.

Because of the technical limitations of conventional comment-reviewsystems, some firms or organizations rely on human reviewers to analyzetextual comments. But human review of such comments can be laborintensive, expensive, and produce inaccurate or inconsistent analyses.For example, human review of thousands, or tens or hundreds ofthousands, or even millions of textual comments may need to be splitbetween multiple groups of reviewers and subject such review todifferent human approaches or inconsistent viewpoints. In many cases,the volume of textual comments is practically impossible for humans toanalyze. Indeed, humans often cannot accurately or consistently analyzeor process such volumes of textual comments. Nor do existingconventional-comment review systems provide tools for consistentanalysis of a common body of comments or review free from the technicallimitations described above.

SUMMARY

This disclosure describes one or more embodiments of methods,non-transitory computer readable media, and systems that solve theforegoing problems in addition to providing other benefits. For example,in some embodiments, the disclosed systems apply machine-learningtechniques to summarize sentences extracted from a group of textualresponses or to select representative-textual responses from the groupof textual responses. By using a response-extraction-neural network toextract sentences from textual responses, the disclosed systems cangenerate a response summary of textual responses based on sentimentindicators corresponding to the textual responses. By applying amachine-learning classifier to generate textual quality scores fortextual responses, the disclosed systems select representative-textualresponses from a group of textual responses based on relevancyparameters and sentiment indicators corresponding to the textualresponses. Such computational techniques generate response summaries andrepresentative responses that provide efficient snapshots of a group oftextual responses analyzed by machine learners.

In some embodiments, for instance, the disclosed systems provide textualresponses to a response-extraction-neural network to generate extractedsentences that correspond to selected sentences from the textualresponses. The disclosed systems further determine a sentiment indicatorfor each extracted sentence to sort the extracted sentences. Based on asorted set of the extracted sentences, the disclosed systems generate afirst cluster of extracted sentences corresponding to a first contextand a second cluster of extracted sentences corresponding to a secondcontext. Based on a first extracted sentence from the first cluster anda second extracted sentence from the second cluster, the disclosedsystems generate a response summary for the textual responses.

Additionally, in certain implementations, the disclosed systems providetextual responses to a text-quality classifier to generate a textualquality score for each textual response. The disclosed systems furtherdetermine a relevancy parameter, a sentiment indicator, and a topic foreach textual response. In each such determination, the relevancyparameter indicates a relevance of a given textual response to a userquery, the sentiment indicator indicates a linguistic sentiment of thegiven textual response, and the topic indicates a subject matter of thegiven textual response. Based on the sentiment indicator and the topicfor each textual response, the disclosed systems generate a firstresponse group of textual responses and a second response group oftextual responses. Based on the relevancy parameter for each textualresponse within the first and second response groups, the disclosedsystems further select representative-textual responses for display on aclient device—including a first representative-textual response from thefirst response group and a second representative-textual response fromthe second response group.

BRIEF DESCRIPTION OF THE DRAWINGS

The detailed description refers to the drawings briefly described below.

FIG. 1 illustrates a block diagram of an environment in which anintelligent-text-insight system can operate in accordance with one ormore embodiments.

FIG. 2 illustrates a conceptual diagram of an intelligent-text-insightsystem applying machine-learning techniques and computational sentimentanalysis to generate a response summary and to selectrepresentative-textual responses from a set of textual responses inaccordance with one or more embodiments.

FIG. 3A illustrates an intelligent-text-insight system training anextractor-selection network to select training sentences fromtextual-training responses based on similar-response sentences andground-truth-paraphrased sentences in accordance with one or moreembodiments.

FIG. 3B illustrates an intelligent-text-insight system training anabstractor network to paraphrase selected-training sentences based onground-truth-paraphrased sentences in accordance with one or moreembodiments.

FIG. 3C illustrates an intelligent-text-insight system training anextractor-selection network and an abstractor network to paraphraseselected-training sentences based on ground-truth-paraphrased sentencesin accordance with one or more embodiments.

FIG. 3D illustrates an intelligent-text-insight system applying aresponse-extraction-neural network and computational sentiment analysisto generate a response summary for a set of textual responses inaccordance with one or more embodiments.

FIGS. 4A-4B illustrate a computing device presenting response summariesin graphical user interfaces in accordance with one or more embodiments.

FIG. 5A illustrates an intelligent-text-insight system training atext-quality classifier to generate textual-quality-training scores fortextual-training responses based on ground-truth-quality scores inaccordance with one or more embodiments.

FIG. 5B an intelligent-text-insight system applying a textual-qualityclassifier and computational sentiment analysis to selectrepresentative-textual responses from a set of textual responses inaccordance with one or more embodiments.

FIGS. 6A-6B illustrate a computing device presentingrepresentative-textual responses in graphical user interfaces inaccordance with one or more embodiments.

FIGS. 7A-7B illustrate sequence-flow diagrams of anintelligent-text-insight system using different architectures to selectrepresentative-textual responses from a set of textual responses inresponse to a query in accordance with one or more embodiments.

FIG. 8 illustrates a flowchart of a series of acts for applying aresponse-extraction-neural network and computational sentiment analysisto generate a response summary for a set of textual responses inaccordance with one or more embodiments.

FIG. 9 illustrates a flowchart of a series of acts for applying atextual-quality classifier and computational sentiment analysis toselect representative-textual responses from a set of textual responsesin accordance with one or more embodiments.

FIG. 10 illustrates a block diagram of a computing device in accordancewith one or more embodiments.

FIG. 11 illustrates a network environment of a digital survey system inaccordance with one or more embodiments.

DETAILED DESCRIPTION

This disclosure describes embodiments of an intelligent-text-insightsystem that applies machine-learning techniques and computationalsentiment analysis to summarize sentences extracted from a group oftextual responses or to select representative-textual responses from thegroup of textual responses. The intelligent-text-insight system cangenerate a response summary of textual responses by using (i) aresponse-extraction-neural network to extract (and sometimes paraphrase)sentences from textual responses and (ii) sentiment indicatorscorresponding to the textual responses. The intelligent-text-insightsystem can also select representative-textual responses from a group oftextual responses by using (i) a machine-learning classifier to generatetextual quality scores for textual responses and (ii) relevancyparameters and sentiment indicators corresponding to the textualresponses. Such computational techniques generate response summaries andrepresentative responses that provide machine-learning-customizedsnapshots of a group of textual responses.

As mentioned above, in some embodiments, the intelligent-text-insightsystem provides textual responses to a response-extraction-neuralnetwork to generate extracted sentences that correspond to selectedsentences from the textual responses. The intelligent-text-insightsystem further determines a sentiment indicator for each extractedsentence to sort the extracted sentences. Based on a sorted set of theextracted sentences, the intelligent-text-insight system generates afirst cluster of extracted sentences corresponding to a first contextand a second cluster of extracted sentences corresponding to a secondcontext. Based on a first extracted sentence from the first cluster anda second extracted sentence from the second cluster, theintelligent-text-insight system subsequently generates a responsesummary for the textual responses. As described below, the first andsecond clusters of extracted sentences are merely examples. Theintelligent-text-insight system can extract and generate such a responsesummary based on any number of extracted-sentence clusters correspondingto any number of contexts.

To provide textual responses to the response-extraction-neural network,in certain implementations, the intelligent-text-insight system selectsa set of textual responses based on a user query or computer-generatedquery. For instance, the intelligent-text-insight system can receive auser query from a client device searching for textual responses thatcorrespond to a time period or topic and that respond to a surveyquestion. The intelligent-text-insight system subsequently selects a setof textual responses to feed the response-extraction-neural networkbased on (i) time identifiers for textual responses that correspond tothe queried time period or (ii) relevance of textual responses to thequeried topic. In response to a query by time period or topic, theintelligent-text-insight system can provide a response summary for theset of textual responses to the client device for display within agraphical user interface.

In addition to extracting sentences from textual responses, in someembodiments, the response-extraction-neural network further paraphrasesthe extracted sentences. To paraphrase such sentences, theintelligent-text-insight system optionally trains both anextractor-selection network of the response-extraction-neural network toextract selected sentences and an abstractor network of theresponse-extraction-neural network to paraphrase selected sentences.Having trained the response-extraction-neural network, theintelligent-text-insight system can extract selected sentences from aset of textual responses using the extractor-selection network of theresponse-extraction-neural network. Upon extraction, theintelligent-text-insight system can paraphrase one or more of theselected sentences using the abstractor network of theresponse-extraction-neural network.

As suggested above, the intelligent-text-insight system can applycomputational sentiment analysis to extracted sentences. For instance,having extracted sentences from textual responses, in certainimplementations, the intelligent-text-insight system determinessentiment indicators by determining a sentiment score indicating apositive sentiment or a negative sentiment for each extracted sentence.Based on such sentiment scores, the intelligent-text-insight systemoptionally generates clusters of extracted sentences. For instance, insome cases, the intelligent-text-insight system generates clusters ofextracted sentences that correspond to positive or negative sentimentscores and to a first or second linguistic context embedding.

After generating clusters of extracted sentences, in some embodiments,the intelligent-text-insight system selects extracted sentences fromsuch clusters based on a relevance of each extracted sentence to acontext of a cluster (e.g., a linguistic context embedding). Forinstance, in certain implementations, the intelligent-text-insightsystem selects first and second extracted sentences respectively fromfirst and second clusters based on the maximum marginal relevance of theextracted sentences to the first and second linguistic contextembeddings for their respective clusters. In some such cases, theintelligent-text-insight system selects multiple extracted sentencesfrom each cluster of extracted sentences.

Based on selected and extracted sentences, the intelligent-text-insightsystem subsequently generates a response summary. For example, in someembodiments, the intelligent-text-insight system generates a responsesummary for a set of textual responses comprising both a first extractedsentence (from a first cluster) and a second extracted sentence (from asecond cluster). By contrast, in certain implementations, theintelligent-text-insight system uses a summarization neural network togenerate compressed sentences representing summaries of multipleextracted sentences from each cluster and composes a response summarycomprising the compressed sentences.

In addition (or in the alternative) to generating a response summary fortextual responses, the intelligent-text-insight system can usemachine-learning techniques to select representative-textual responsesfrom a set of textual responses. For instance, in some embodiments, theintelligent-text-insight system provides textual responses to atext-quality classifier to generate a textual quality score for eachtextual response. The intelligent-text-insight system further determinesa relevancy parameter, a sentiment indicator, and a topic for eachtextual response. In each such determination, the relevancy parameterindicates a relevance of a given textual response to a user query, thesentiment indicator indicates a linguistic sentiment of the giventextual response, and the topic indicates a subject matter of the giventextual response. Based on the sentiment indicator and the topic foreach textual response, the intelligent-text-insight system generates afirst response group of textual responses and a second response group oftextual responses. Based on the relevancy parameter for each textualresponse within the first and second response groups, theintelligent-text-insight system further selects representative-textualresponses for display on a client device—including a firstrepresentative-textual response from the first response group and asecond representative-textual response from the second response group.As described below, the first and second response groups andrepresentative-textual responses are merely examples. Theintelligent-text-insight system can generate any number of responsegroups of textual responses and select any number ofrepresentative-textual responses.

To provide textual responses to the text-quality classifier, in certainimplementations, the intelligent-text-insight system selects a set oftextual responses based on a user query or computer-generated query. Forinstance, the intelligent-text-insight system can select a set oftextual responses to feed the text-quality classifier based on (i) timeidentifiers for textual responses that correspond to a queried timeperiod or (ii) textual responses relevant to a queried topic. Inresponse to a query by time period or topic, theintelligent-text-insight system can provide representative-textualresponses from a set of textual responses to the client device fordisplay within a graphical user interface.

Having selected a set of textual responses, the intelligent-text-insightsystem feeds the set of textual responses to the text-quality classifierto generate textual quality scores. To generate such scores, theintelligent-text-insight system optionally trains a text-qualityclassifier—such as a decision-tree classifier or a neural-networkclassifier—to generate textual quality scores for textual-trainingresponses based on ground-truth-quality scores. Having trained theclassifier, the intelligent-text-insight system uses the text-qualityclassifier to generate a textual quality score for each textual responsefrom a set of textual responses based on textual feature dimensions foreach textual response.

In some embodiments, the intelligent-text-insight system uses textualquality scores to sort a set of textual responses. For instance, theintelligent-text-insight system can select textual responses satisfyinga textual-quality-score threshold and sort the selected textualresponses based on a relevancy parameter for each selected textualresponse. The intelligent-text-insight system optionally generates sucha set of sorted textual responses as a precursor to creating responsegroups of textual responses.

In addition to sorting textual responses, the intelligent-text-insightsystem can determine one or both of sentiment indicators andsentiment-polarity indicators for each textual response as a basis forcreating response groups. For instance, in some embodiments, theintelligent-text-insight system determines a sentiment score indicatinga positive sentiment or a negative sentiment for each extractedsentence. The intelligent-text-insight system can further determine asentiment-polarity indicator based on the sentiment score for eachtextual response satisfying (or not satisfying) apositive-sentiment-score threshold or a negative-sentiment-scorethreshold.

In certain implementations, the intelligent-text-insight systemgenerates response groups of textual responses based on the sentimentscore, the sentiment-polarity indicator, and the topic for each textualresponse. Such response groups may include, for instance, a firstresponse group of textual responses corresponding to a first range ofsentiment scores and a second response group of textual responsescorresponding to a second range of sentiment scores. Accordingly,response groups may correspond to positive or negative sentiment scoreswithin a defined range.

As noted above, in certain implementations, the intelligent-text-insightsystem selects representative-textual responses from response groups oftextual responses for display within a graphical user interface. Forinstance, in some embodiments, the intelligent-text-insight systemselects first and second representative-textual responses respectivelyfrom first and second response groups based on a textual quality scoreand relevancy parameter for each textual response within theirrespective response groups.

As suggested above, the intelligent-text-insight system overcomesseveral technical deficiencies that hinder conventional comment-reviewsystems. First, the intelligent-text-insight system introduces newanalysis functions and graphical user interfaces that efficientlypresent machine-learning-customized snapshots of textual responses. Byusing a response-extraction-neural network to extract (and sometimesparaphrase) sentences from textual responses—and by relying oncomputational sentiment analysis—the intelligent-text-insight systemgenerates a response summary that captures common sentiments expressedin a set of textual responses. Such a response summary avoidsconventional application of the algorithmic filters for purchaseverification, views, or user votes that can obscure common sentiments inconventional comment-review systems. Rather than burying recurrent ideasfound in a set of textual responses, the intelligent-text-insight systememploys an ordered algorithm of machine learning and sentiment analysisto surface common concepts from textual responses in a concise summary.

In addition to a response summary, in some embodiments, theintelligent-text-insight system generates representative-textualresponses from a group of textual responses by employingmachine-learning classification and sentiment indicators correspondingto a set of textual responses. In contrast to the outlier comments fromconventional comment-review systems, the intelligent-text-insight systemuses machine learning to select representative-textual responses thatcapture common complaints, common praise, and other common conceptsexpressed in a subset of textual responses. By applying amachine-learning classifier and computational sentiment analysis, incertain implementations, the intelligent-text-insight system can formresponse groups corresponding to different positive and negativesentiments and display verbatim representative-textual responses fromsuch response groups.

Second, the intelligent-text-insight system improves the efficiency andflexibility with which a comment-review system displays textualresponses. As noted above, some conventional comment-review systemslimit search or display of textual comments to keyword searches oranswers to frequently answered questions. Rather than limiting textualresponses by keyword search or FAQs, in some embodiments, theintelligent-text-insight system introduces an efficient and flexibleanalysis of textual responses to surface concise representations of suchresponses by time period or topic. In response to a query by time periodor topic, for instance, the intelligent-text-insight system can applymachine-learning techniques to return one or both of a response summaryand representative-textual responses for an entire body of textualresponses—according to the queried time period or queried topic. In someembodiments, the intelligent-text-insight system can also generate asummary comparison of different response summaries corresponding todifferent time periods to indicate changes over time in textualresponses (e.g., in response to a particular survey question).

In addition to efficient and flexible queries by time period or topic,the intelligent-text-insight system further streamlines or simplifiesuser experience with consolidated graphical user interfaces. In contrastto some conventional comment-review systems that funnel users throughmultiple selectable options and user interfaces, theintelligent-text-insight system provides graphical user interfacescomprising response summaries or representative-textual responses thatconsolidate or surface frequently expressed issues from textualresponses. Such consolidated summaries and representative-textualresponses eliminate excess navigation through user interfaces.

Third, the intelligent-text-insight system can automate a process ofsummarizing textual responses or surfacing representative-textualresponses that computing systems previously could not perform. Assuggested above, conventional comment-review systems lack the technicalcapability of automatically summarizing textual responses or surfacingrepresentative-textual responses. In the absence of such technicalcapabilities, human analysts often review textual responses in what canbe a labor-intensive and expensive process that often producesinaccurate or inconsistent summaries or representative responses. Insome cases, the volume of textual responses is practically impossiblefor humans to accurately or consistently analyze. Theintelligent-text-insight system obviates such human review by employingan algorithm involving machine learners and computational sentimentanalysis to capture common concepts from textual responses in a concisesummary or in representative-textual responses selected for display to auser.

Turning now to the figures, FIG. 1 illustrates a block diagram of asystem environment (“environment”) 100 in which a digital survey system104 and an intelligent-text-insight system 106 operate in accordancewith one or more embodiments. As illustrated in FIG. 1, the environment100 includes server device(s) 102, an administrator device 108, andrecipient devices 114 a-114 n, where the server device(s) 102 includethe digital survey system 104. As shown in FIG. 1, the digital surveysystem 104 comprises the intelligent-text-insight system 106. Each ofthe administrator device 108 and the recipient devices 114 a-114 n areassociated with a type of user. An administrator 112 is associated withthe administrator device 108 and uses the administrator device 108 tomanage the creation and distribution of a digital survey or review oftextual responses. Recipients 108 a-108 n are respectively associatedwith the recipient devices 114 a-114 n and use the recipient devices 114a-114 n to provide textual responses (e.g., to a digital surveyquestion).

In some embodiments, the administrator device 108 and the recipientdevices 114 a-114 n communicate with server device(s) 102 over a network120. As described below, the server device(s) 102 can enable the variousfunctions, features, processes, methods, and systems described hereinusing, for example, the intelligent-text-insight system 106. As shown inFIG. 1, the intelligent-text-insight system 106 comprises computerexecutable instructions that, when executed by a processor of the serverdevice(s) 102, perform certain actions described below with reference toFIGS. 2-9. Additionally, or alternatively, in some embodiments, theserver device(s) 102 coordinate with one or both of the administratordevice 108 and the recipient devices 114 a-114 n to perform or providethe various functions, features, processes, methods, and systemsdescribed in more detail below. Although FIG. 1 illustrates a particulararrangement of the server device(s) 102, the administrator device 108,the recipient devices 114 a-114 n, and the network 120, variousadditional arrangements are possible. For example, the server device(s)102 and the digital survey system 104 may directly communicate with theadministrator device 108, bypassing the network 120.

Generally, the administrator device 108 and recipient devices 114 a-114n may be any one of various types of client devices. For example, theadministrator device 108 and recipient devices 114 a-114 n may be mobiledevices (e.g., a smart phone, tablet), laptops, desktops, or any othertype of computing devices, such as those described below with referenceto FIG. 10. Additionally, the server device(s) 102 may include one ormore computing devices, including those explained below with referenceto FIG. 10. The server device(s) 102, the administrator device 108, andthe recipient devices 114 a-114 n may communicate using anycommunication platforms and technologies suitable for transporting dataand/or communication signals, including the examples described belowwith reference to FIG. 11.

To access the functionalities of the intelligent-text-insight system106, in certain embodiments, the administrator 112 interacts with anadministrator application 110 on the administrator device 108.Similarly, to access digital surveys, compose textual responses, orother functions of the digital survey system 104, in someimplementations, the recipients 108 a-108 n interact with digitalresponse applications 116 a-116 n, respectively. In some embodiments,one or both of the administrator application 110 and the digitalresponse applications 116 a-116 n comprise web browsers, applets, orother software applications (e.g., native applications or webapplications) available to the administrator device 108 or the recipientdevices 114 a-114 n, respectively. Additionally, in some instances, thedigital survey system 104 provides data packets including instructionsthat, when executed by the administrator device 108 or the recipientdevices 114 a-114 n, create or otherwise integrate the administratorapplication 110 or the digital response applications 116 a-116 n withinan application or webpage for the administrator device 108 or therecipient devices 114 a-114 n, respectively.

As an initial overview, the server device(s) 102 provide theadministrator device 108 access to the digital survey system 104 and theintelligent-text-insight system 106 by way of the network 120. In one ormore embodiments, by accessing the digital survey system 104, the serverdevice(s) 102 provide one or more digital documents to the administratordevice 108 to enable the administrator 112 to compose a digital survey.For example, the digital survey system 104 can include a website (e.g.,one or more webpages) or utilize the administrator application 110 toenable the administrator 112 to create a digital survey or other digitalcontent for distribution to the recipient devices 114 a-114 n.

In some cases, the administrator device 108 launches the administratorapplication 110 to facilitate interacting with the digital survey system104 or its constituent intelligent-text-insight system 106. Theadministrator application 110 may coordinate communications between theadministrator device 108 and the server device(s) 102 that ultimatelyresult in the creation of a digital survey or other digital content thatthe digital survey system 104 distributes to one or more of therecipient devices 114 a-114 n. For instance, to facilitate the creationof a digital survey, the administrator application 110 can providegraphical user interfaces of the digital survey system 104, receiveindications of interactions from the administrator 112 with theadministrator device 108, and cause the administrator device 108 tocommunicate user input based on the detected interactions to the digitalsurvey system 104, such as communicating a textual response.

As noted above, the intelligent-text-insight system 106 can applymachine-learning techniques and computational sentiment analysis tosummarize sentences extracted from a group of textual responses or toselect representative-textual responses from the group of textualresponses. FIG. 2 provides a brief example of one such embodiment of theintelligent-text-insight system 106. In particular, FIG. 2 illustratesthe intelligent-text-insight system 106 applying aresponse-extraction-neural network 204 and computational sentimentanalysis to generate a response summary 210 for a set of textualresponses. FIG. 2 further illustrates the intelligent-text-insightsystem 106 applying a text-quality classifier 208 and computationalsentiment analysis to select representative-textual responses 212 from aset of textual responses.

As shown in FIG. 2, the digital survey system 104 receives textualresponses 202 a-202 n from recipient devices. As used in thisdisclosure, the term “textual response” refers to a text that forms oneor more statements or questions. In particular, a textual responseincludes one or more sentences of human-readable characters respondingto a survey question, digital posting, digital message, or other digitalcontent. For example, a textual response can include sentences composedby a respondent in response to a survey question distributed by thedigital survey system 104. As a further example, a textual response caninclude sentences composed by a respondent in an email, a digital oronline posting, a text message, or other digital message responding toanother email, digital or online posting, text message, or other digitalmessage.

After receiving the textual responses 202 a-202 n, theintelligent-text-insight system 106 selects (from the textual responses202 a-202 n) a set of textual responses to provide to aresponse-extraction-neural network 204. For instance, theintelligent-text-insight system 106 may select textual responsesresponding to a survey question based on one or both of a relevance to aquery and recency of textual response.

As used in this disclosure, the term “response-extraction-neuralnetwork” refers to an artificial neural network that selects sentencesfrom textual responses based on characteristics of each textualresponse. In particular, in certain implementations, aresponse-extraction-neural network refers to an artificial neuralnetwork that extracts selected sentences from textual responses andparaphrases the extracted sentences. A response-extraction-neuralnetwork can comprise one or more encoders and one or more decoders, suchas an encoder-decoder-neural network or encoders and decoders part of(or used in conjunction with) a convolutional neural network (“CNN”) orrecurrent neural network (“RNN”). As explained further below, in somecases, a response-extraction-neural network includes both anextractor-selection network for extracting selected sentences and anabstractor network for paraphrasing the selected sentences.

Relatedly, the term “extracted sentence” refers to a sentence extractedfrom (or identified within) a textual response. In particular, in someembodiments, an extracted sentence refers to a complete or incompletesentence selected by a response-extraction-neural network from a textualresponse. In some such cases, the response-extraction-neural networkselects sentences from textual responses based on relevance to a query.For example, the intelligent-text-insight system 106 may use theresponse-extraction-neural network 204 to generate a set of extractedsentences from a set of textual responses, where each extracted sentencecorresponds to a sentence selected by the response-extraction-neuralnetwork 204 from the textual responses 202 a-202 n.

Upon generating a set of extracted sentences, theintelligent-text-insight system 106 inputs each extracted sentence fromthe set into a text-analysis engine 206 to determine a sentimentindicator for each extracted sentence. As used in this disclosure, theterm “sentiment indicator” refers to a field, label, score, or otherindication of a linguistic sentiment. In particular, in someembodiments, a sentiment indicator refers to a sentiment label orsentiment score determined by computational sentiment analysis for atextual response or a sentence. Such computational sentiment analysiscan include lexicon-based sentiment analysis (also known asknowledge-based sentiment analysis) or statistical-based sentimentanalysis.

When the text-analysis engine 206 analyzes an extracted sentence, forexample, the text-analysis engine 206 can determine a sentiment label ofpositive, negative, neutral, or mixed for each extracted sentence.Additionally, or alternatively, the text-analysis engine 206 candetermine a sentiment score for each extracted sentence (e.g., asentiment score between −10 and 10). In some embodiments, such asentiment score indicates a positive or negative sentiment of theextracted sentence. Based on the sentiment indicators, theintelligent-text-insight system 106 can sort the set of extractedsentences (e.g., accordingly to polarity of sentiment scores or rangesof sentiment scores).

Based on a sorted set of extracted sentences, theintelligent-text-insight system 106 generates clusters of extractedsentences corresponding to different contexts. As used in thisdisclosure, the term “cluster of extracted sentences” refers to agrouping of extracted sentences that correspond to a linguistic context.In particular, in some embodiments, a cluster of extracted sentencesrefers to a grouping of extracted sentences that correspond to alinguistic context embedding. In some cases, a cluster of extractedsentences further corresponds to a range of (or specific) positive ornegative sentiment scores.

Relatedly, the term “context” refers to a set of linguisticcharacteristics surrounding a word, phrase, or sentence. In particular,in some embodiments, a context refers to a linguistic context embeddingfor a word, phrase, or sentence. For example, in some embodiments, alinguistic context embedding includes a word embedding or a sentenceembedding for an extracted sentence. When a grouping of extractedsentences most closely correspond to a particular context or to alinguistic context embedding, the grouping of extracted sentences form acluster of extracted sentences.

After generating clusters of extracted sentences, theintelligent-text-insight system 106 selects one or more extractedsentences from each cluster or from some of the clusters. For example,in some embodiments, the intelligent-text-insight system 106 selects afirst extracted sentence from a first cluster of extracted sentences anda second extracted sentence from a second cluster of extractedsentences. As indicated by FIG. 2, the intelligent-text-insight system106 subsequently generates the response summary 210 for a set of textualresponses based on extracted sentences selected from clusters ofextracted sentences.

As used in this disclosure, the term “response summary” refers to asynopsis of a set of textual responses. In particular, a responsesummary includes a compressed paraphrasing of extracted sentences from aset of textual responses. In some embodiments, for instance, a responsesummary may paraphrase textual responses that respond to a particularsurvey question (e.g., a plain language summary of respondents' writtenresponses to a common survey question). Additionally, or alternatively,a response summary may paraphrase textual responses concerning aparticular topic (e.g., a product or service). A response summary cancome in the form of a plain language representation of an underlying setof textual responses and look itself as a textual response. By contrast,a response summary can also come in the form of an executive summarydescribing common issues in an underlying set of textual responses.

As further shown in FIG. 2, in some embodiments, theintelligent-text-insight system 106 applies the text-quality classifier208 and computational sentiment analysis to selectrepresentative-textual responses 212 from a set of textual responses.Similar to the initial selection process above, after receiving thetextual responses 202 a-202 n, the intelligent-text-insight system 106selects from the textual responses 202 a-202 n a set of textualresponses to provide to the text-quality classifier 208. For instance,the intelligent-text-insight system 106 may select a set of textualresponses responding to a digital survey based on one or both of arelevance of textual responses to a query and recency of response. Thetext-quality classifier 208 subsequently generates a textual qualityscore for each textual response from the set of textual responses.

As used in this disclosure, the term “text-quality classifier” refers toa machine-learning classifier that classifies textual responses bydetermining a score indicating a quality of the writing of the textualresponse. In particular, in certain implementations, a text-qualityclassifier refers to a decision-tree classifier or a neural-networkclassifier that generates scores classifying a quality of a textualresponse's writing based on textual feature dimensions for (orcorresponding to) the textual response. For example, a text-qualityclassifier may include a linear regression, multi-layered perceptron,random-forest classifier, or support vector machine (“SVM”). Bycontrast, a text-quality classifier may include a CNN or RNN.

Relatedly, the term “textual quality score” refers to a score indicatinga quality of writing in a textual response based on linguistic ortextual features of the textual response. In particular, in someembodiments, a textual quality score refers to a score indicating aquality of a textual response's writing based on textual featuresextracted from the textual response. Such textual features may reflect,but are not limited to, readability indices, response composition,speech composition, coherence, point-of-view identification, andsuggestion identification of a textual response.

In addition to generating a textual quality score, theintelligent-text-insight system 106 provides each textual response tothe text-analysis engine 206 to determine a relevancy parameter, asentiment indicator, and a topic for each textual response from the setof textual responses. As used in this disclosure, the term “relevancyparameter” refers to a parameter indicating a relevance of a textualresponse to a query. For instance, a relevancy parameter may refer to atopic-similarity score indicating a relevance of a textual response to atopic query. Such a topic-similarity score may be, for instance, aLucene search-engine score based on search terms in a topic query, suchas a score determined by a function of a normalized termfrequency-inverse document frequency (“TF-IDF”). As a further example, atopic-similarity score may include a similarity measure of a vectorrepresentation of a text of the textual response and a topic query,where the similarity measure is based on various similarity measures,such as cosine similarity.

Additionally, or alternatively, a relevancy parameter may refer to (orinclude) a recency of a textual response according to a time at whichthe textual response was sent or received. As a further example, arelevancy parameter may refer to (or include) a similarity between atext of a textual response and a corresponding question prompting thetextual response. In some embodiments, a relevancy parameter includesany combination of a topic-similarity score, a recency of a textualresponse, and a similarity between a textual response and a promptingquestion.

Relatedly, the term “topic” refers to a subject matter of a textualresponse. In particular, in some embodiments, a topic includes a topicidentifier or tag indicating a subject matter of a textual responseapplied to the textual response by a human reviewer or by a computer(e.g., based on a topic of a survey question or based on naturallanguage analysis). In some such embodiments, a computer automaticallyapplies a topic tag or topic identifier to a textual response orconstituent sentence based on frequency of a term or a synonym for aterm within a textual response or within a particular sentence.

After determining sentiment indicators and topics, theintelligent-text-insight system 106 generates response groups of textualresponses from the set of textual responses based on one or both of thesentiment indicator and the topic for each textual response. As used inthis disclosure, the term “response group” refers to a grouping oftextual responses at least according to sentiment indicators. Inparticular, in some embodiments, the term response group refers to agrouping of textual responses by one or both of (i) sentiment indicatoror range of sentiment scores and (ii) topic. Accordingly, in someembodiments, a response group of textual response includes textualresponses corresponding to a particular range of sentiment scores orcorresponding to a positive sentiment indicator, a negative sentimentindicator, a neutral sentiment indicator, or a mixed sentimentindicator. Such a response group of textual responses may further besegmented according to topic.

As indicated by FIG. 2, in addition to generating response groups oftextual responses, the intelligent-text-insight system 106 selects therepresentative-textual responses 212 from such response groups fordisplay on a client device. In some cases, the intelligent-text-insightsystem 106 selects a representative-textual response of therepresentative-textual responses 212 based in part on the relevancyparameter for each textual response within a response group. As usedherein, the term “representative-textual response” refers to a textualresponse indicative or representative of a subset of textual responses.In particular, in some embodiments, the term representative-textualresponse refers to a textual response expressing concepts or issuescommonly found in a response group of textual responses. For instance, arepresentative-textual response may capture a commonly expressedcomplaint, praise, or other concept in a response group corresponding toa positive, mixed, neutral, or negative sentiment.

As further indicated in FIG. 2, in some embodiments, theintelligent-text-insight system 106 generates the response summary 210and the representative-textual responses 212 for a same set of textualresponses. By providing both the response summary 210 and therepresentative-textual responses 212, the intelligent-text-insightsystem 106 can provide two different types of concise snapshots of a setof textual responses within a single user interface.

As suggested above, in some embodiments, the intelligent-text-insightsystem 106 trains one or both of an extractor-selection network and anabstractor network of the response-extraction-neural network 204. FIGS.3A-3C depict examples of such training. As an overview, FIG. 3Aillustrates the intelligent-text-insight system 106 training anextractor-selection network 304 of the response-extraction-neuralnetwork 204 to extract selected sentences. FIG. 3B illustratesintelligent-text-insight system 106 training an abstractor network 316of the response-extraction-neural network 204 to paraphrase selectedsentences. FIG. 3C illustrates the intelligent-text-insight system 106using reinforced learning to train the extractor-selection network 304and the abstractor network 316 with a critic network 330 to paraphraseselected sentences.

As shown in FIG. 3A, the intelligent-text-insight system 106 iterativelyprovides textual-training responses to the extractor-selection network304 to extract selected-training sentences from correspondingtextual-training responses. In each training iteration, theintelligent-text-insight system 106 determines a loss from a lossfunction 308 based on a comparison of a selected-training sentence and asimilar-response sentence, where the similar-response sentencecorresponds to a ground-truth-paraphrased sentence. Theintelligent-text-insight system 106 subsequently adjusts networkparameters of the extractor-selection network 304 based on thedetermined loss. The intelligent-text-insight system 106 then inputs afurther textual-training response into the extractor-selection network304 as part of a subsequent training iteration.

In an initial training iteration, for example, theintelligent-text-insight system 106 inputs a textual-training response302 into the extractor-selection network 304. As part of such input, insome embodiments, the intelligent-text-insight system 106 parses andtokenizes the textual-training response 302 into a list of responsesentences. The intelligent-text-insight system 106 subsequently inputsthe list of response sentences (or each response sentence) from thetextual-training response 302 into the extractor-selection network 304.

As used in this disclosure, the term “textual-training response” refersto a textual response used for training a response-extraction-neuralnetwork. In some embodiments, a textual-training response includes anemail, a digital or online posting, a text message, or other digitalmessage. By contrast, in certain implementations, a textual-trainingresponse includes published or publicly available written content, suchas a news article, academic article, or industry publicationcorresponding to (or including) a written abstract or summary of thearticle or publication.

The term “extractor-selection network” refers to an artificial neuralnetwork that is part of (or constitutes) a response-extraction-neuralnetwork and that extracts selected sentences from textual responses. Insome embodiments, an extractor-selection network comprises one or moreencoders and one or more decoders, such as an encoder-decoder-neuralnetwork or encoders and decoders part of (or used in conjunction with)an RNN. For example, an extractor-selection network may include aconvolutional sentence encoder, an RNN that acts as an encoder, and anRNN that acts as a decoder. In some such embodiments, theextractor-selection network comprises the convolutional sentence encoderand long-short-term memory (“LSTM”) layered RNNs of an extractor agentdescribed by Yen-Chun Chen and Mohit Bansal, “Fast AbstractiveSummarization with Reinforce-Selected Sentence Rewriting,” Proceedingsof the 56th Annual Meeting of the Association for ComputationalLinguistics, pp. 675-686 (2018) (hereinafter “Chen”), the entirecontents of which are hereby incorporated by reference.

Upon receiving the textual-training response 302, theextractor-selection network 304 extracts a selected-training sentence306 from the textual-training response 302. The term “selected-trainingsentence” refers to a response sentence selected and extracted from atextual response during training of a response-extraction-neuralnetwork. In some embodiments, for instance, the extractor-selectionnetwork 304 determines an encoded representation and a context-awarerepresentation for each sentence within the textual-training response302. Based on the representation and context-aware representation foreach sentence, the extractor-selection network 304 selects and extractsthe selected-training sentence 306 from the textual-training response302. In some such cases, a selected-training sentence, encodedrepresentation, and context-aware representation respectively correspondto the symbols j_(t), r_(j), and h_(j) described in Chen.

As further indicated in FIG. 3A, the intelligent-text-insight system 106determines a loss from the loss function 308 based on a comparison ofthe selected-training sentence 306 and a similar-response sentence 310.As used in this disclosure, the term “similar-response sentence” refersto a response sentence from a textual-training response that is similarto a ground-truth-paraphrased sentence. In particular, in someembodiments, a similar-response sentence refers to a response sentencefrom a textual-training response that is most similar to aground-truth-paraphrased sentence from a ground-truth summary. In suchcases, the intelligent-text-insight system 106 identifies a most similarresponse sentence based on individual sentence scores for each sentencein the textual-training response and the ground-truth summary. Forinstance, the intelligent-text-insight system 106 may determine scoresfor each sentence in textual-training responses—and for each sentence inground-truth summaries of the textual-training responses—as a basis foridentifying a similar-response sentence. In some cases, asimilar-response sentence represents the most similar document sentenced_(jt), as described in Chen.

As just indicated, the concept of a similar-response sentence relates toa ground-truth summary and a ground-truth-paraphrased sentence. The term“ground-truth summary” refers to an empirically observed orhuman-written abstract or summary of a textual-training response. Forexample, a ground-truth summary may include a summary of a writtensurvey response prepared by a human or analyst. As a further example, aground-truth summary may include a written abstract for a news article,academic article, or industry publication. The term“ground-truth-paraphrased sentence” refers to a sentence from aground-truth summary. Continuing the examples from above, aground-truth-paraphrased sentence may be a sentence from a writtensummary of a survey response or a sentence from a written abstract foran article. When training the extractor-selection network 304, in someembodiments, the intelligent-text-insight system 106 uses aground-truth-paraphrased sentence as a reference point for asimilar-response sentence, the latter of which theintelligent-text-insight system 106 compares to a selected-trainingsentence in the loss function 308.

As indicated by FIG. 3A, the intelligent-text-insight system 106 may usea cross-entropy-loss function to compare the selected-training sentence306 and the similar-response sentence 310 and to determine a loss. Inthe alternative to a cross-entropy-loss function, in some embodiments,the intelligent-text-insight system 106 uses an L2-loss function,mean-absolute-error-loss function, a mean-squared-error-loss function, aroot-mean-squared-error function, or other suitable loss function as theloss function 308.

Upon determining a loss from the loss function 308, theintelligent-text-insight system 106 adjusts network parameters (e.g.,weights or values) of the extractor-selection network 304 to decrease aloss for the loss function 308 in a subsequent training iteration. Forexample, the intelligent-text-insight system 106 may increase ordecrease weights or values of the extractor-selection network 304 tominimize the loss in a subsequent training iteration.

As suggested by FIG. 3A, after adjusting network parameters of theextractor-selection network 304 for an initial training iteration, theintelligent-text-insight system 106 performs additional trainingiterations until satisfying a convergence criteria. For instance, theintelligent-text-insight system 106 iteratively providestextual-training responses to the extractor-selection network 304 toextract selected-training sentences, iteratively determines losses fromthe loss function 308 based on comparisons of the selected-trainingsentences and similar-response sentences, and iteratively adjustsnetwork parameters of the extractor-selection network 304 based on thedetermined losses. In some cases, the intelligent-text-insight system106 performs training iterations until the value or weights of theextractor-selection network 304 do not change significantly acrosstraining iterations.

Turning back now to FIG. 3B, as shown in this figure, theintelligent-text-insight system 106 iteratively providesselected-training sentences to the abstractor network 316 to generateparaphrased-training sentences. In each training iteration, theintelligent-text-insight system 106 further determines a loss from aloss function 320 based on a comparison of a paraphrased-trainingsentence and a ground-truth paraphrased sentence, where the ground-truthparaphrased sentence comes from a ground-truth summary. Theintelligent-text-insight system 106 subsequently adjusts networkparameters of the abstractor network 316 based on the determined loss.The intelligent-text-insight system 106 then inputs a furtherselected-training sentence into the abstractor network 316 as part of asubsequent training iteration.

In an initial iteration, for example, the intelligent-text-insightsystem 106 inputs a selected-training sentence 314 into the abstractornetwork 316. In some embodiments, the selected-training sentence 314 isthe selected-training sentence 306 output by the extractor-selectionnetwork 304. As used in this disclosure, the term “abstractor network”refers to an artificial neural network that paraphrases sentencesselected from textual responses. In some embodiments, an abstractornetwork comprises an encoder and a decoder, such as anencoder-decoder-neural network or a bidirectional RNN comprising an RNNencoder and an RNN decoder. For example, an abstractor network mayinclude an RNN encoder and an RNN decoder with a feedforward neuralnetwork that aligns sentences from the RNN encoder to the RNN decoder.In some such embodiments, the abstractor network comprises theencoder-aligner-decoder described by Chen or the architecture of thebidirectional RNN and alignment model described by Dzmitry Bandanau etal., “Neural Machine translation by jointly Learning to Align andTranslate,” International Conference on Learning Representations (2015)(hereinafter “Bandanau”), the entire contents of which are herebyincorporated by reference.

Upon receiving the selected-training sentence 314, the abstractornetwork 316 compresses and paraphrases the selected-training sentence togenerate a paraphrased-training sentence 318. The term“paraphrased-training sentence” refers to a response sentence compressedor summarized by a response-extraction-neural network. In particular, insome cases, a paraphrased-training sentence represents a summarizedversion of a sentence extracted from a textual response.

As further shown in FIG. 3B, the intelligent-text-insight system 106determines a loss from the loss function 320 based on a comparison ofthe paraphrased-training sentence 318 and a ground-truth-paraphrasedsentence 322. For example, the intelligent-text-insight system 106 mayuse a cross-entropy-loss function to compare the paraphrased-trainingsentence 318 and the ground-truth-paraphrased sentence 322 and todetermine a loss. In the alternative to a cross-entropy-loss function,in some embodiments, the intelligent-text-insight system 106 uses anL2-loss function, mean-absolute-error-loss function, amean-squared-error-loss function, or other suitable loss function as theloss function 320.

Upon determining a loss from the loss function 320, theintelligent-text-insight system 106 adjusts network parameters (e.g.,weights or values) of the abstractor network 316 to decrease a loss forthe loss function 320 in a subsequent training iteration. For example,the intelligent-text-insight system 106 may increase or decrease weightsor values of the abstractor network 316 to decrease the loss in asubsequent training iteration.

As suggested by FIG. 3B, after adjusting network parameters of theabstractor network 316 for an initial training iteration, theintelligent-text-insight system 106 performs additional trainingiterations until satisfying a convergence criteria. For instance, theintelligent-text-insight system 106 iteratively providesselected-training sentences to the abstractor network 316 to generateparaphrased-training sentences, iteratively determines losses from theloss function 320 based on comparisons of the paraphrased-trainingsentences and ground-truth-paraphrased sentences, and iterativelyadjusts network parameters of the abstractor network 316 based on thedetermined losses. In some cases, the intelligent-text-insight system106 performs training iterations until the value or weights of theabstractor network 316 do not change significantly across trainingiterations.

As noted above, the intelligent-text-insight system 106 optionallytrains the extractor-selection network 304 and the abstractor network316 together using reinforced learning. FIG. 3C illustrates an exampleof one such reinforced learning model. In particular, FIG. 3C depictsthe intelligent-text-insight system 106 using the critic network 330 totrain the extractor-selection network 304 and the abstractor network 316within the response-extraction-neural network to paraphrase selectedsentences.

In an initial iteration, for example, the intelligent-text-insightsystem 106 inputs a textual-training response 324 into theextractor-selection network 304 to generate a selected-training sentence326 as an input for the abstractor network 316. Upon receipt, theabstractor network 316 subsequently generates a paraphrased-trainingsentence 328 for the selected-training sentence 326. Theintelligent-text-insight system 106 further inputs theparaphrased-training sentence 328 and a ground-truth-paraphrasedsentence 332 into the critic network 330. The critic network 330subsequently generates a match score 334 indicating a degree to whichthe paraphrased-training sentence 328 matches or resembles theground-truth-paraphrased sentence 332. The intelligent-text-insightsystem 106 further adjusts network parameters of one or both of theextractor-selection network 304 and the abstractor network 316 based onthe match score 334.

As used in this disclosure, the term “critic network” refers to amachine learner that determines or evaluates a degree to which aparaphrased-training sentence resembles a ground-truth-paraphrasedsentence. In some embodiments, a critic network constitutes an ActorAdvantage Critic (“A2C”) model that implements a policy gradient. Toimplement such a policy gradient in a given training iteration, thecritic network 330 optionally observes a current state c_(t) of atextual-training response and its constituent sentences, samples aparaphrased-training sentence generated from a selected-trainingsentence d_(jt), and generates a Recall-Oriented Understudy for GistingEvaluation (“ROUGE”) match score comparing the paraphrased-trainingsentence to a ground-truth-paraphrased sentence. In some cases, theintelligent-text-insight system 106 uses the critic network andROUGE-match score described in Chen.

By employing a reinforced learning model, in some embodiments, thecritic network 330 uses a match score as a reward. For example, if theextractor-selection network 304 extracts a selected-training sentencerelevant to a ground-truth summary—and the abstractor network 316compresses the relevant selected-training sentence into aparaphrased-training sentence—the critic network 330 generates aROUGE-match score as a reward indicating a relatively high degree ofmatch between the paraphrased-training sentence and aground-truth-paraphrased sentence. By contrast, if theextractor-selection network 304 extracts a selected-training sentenceirrelevant to the ground-truth summary—and the abstractor network 316compresses the irrelevant selected-training sentence into aparaphrased-training sentence—the critic network 330 generates aROUGE-match score as a deterrent indicating a lower degree of matchbetween the paraphrased-training sentence and a ground-truth-paraphrasedsentence.

Continuing with the initial training iteration in FIG. 3C, theintelligent-text-insight system 106 uses the critic network 330 todetermine a loss based on a comparison of the paraphrased-trainingsentence 328 and the ground-truth-paraphrased sentence 332. For example,in some embodiments, the critic network 330 determines a squared lossbetween a baseline critic and a total return of expected rewards interms of ROUGE-match scores. This total return represents an estimate ofrewards from an action-value function (e.g., maximum expectedROUGE-match scores) for extracting selected-training sentences from eachobserved state (e.g., from each textual-training response). In theinitial training iteration, the intelligent-text-insight system 106adjusts parameters of the critic network 330 to decrease a squared lossin a subsequent training iteration.

As suggested by FIG. 3C, after adjusting network parameters of theresponse-extraction-neural network 204 and the critic network 330 for aninitial training iteration, the intelligent-text-insight system 106performs additional training iterations until satisfying a convergencecriteria. For instance, the intelligent-text-insight system 106iteratively provides textual-training responses to theresponse-extraction-neural network 204 to extract selected-trainingsentences and generate paraphrased-training sentences and iterativelydetermines match scores based on comparisons of the paraphrased-trainingsentences and ground-truth-paraphrased sentences. Based on the matchscores, the intelligent-text-insight system 106 adjusts networkparameters of the response-extraction-neural network 204 and the criticnetwork 330. By performing multiple training iterations, theintelligent-text-insight system 106 trains the critic network 330 tominimize a squared loss.

In addition (or in the alternative) to training theresponse-extraction-neural network 204, in some embodiments, theintelligent-text-insight system 106 applies theresponse-extraction-neural network 204 and computational sentimentanalysis to generate a response summary for a set of textual responses.FIG. 3D illustrates one such embodiment of the intelligent-text-insightsystem 106.

As shown in FIG. 3D, the intelligent-text-insight system 106 providestextual responses 336 a-336 n to the response-extraction-neural network204 to respectively generate extracted sentences 340 a-340 n for thetextual responses 336 a-336 n. The intelligent-text-insight system 106further determines sentiment indicators 344 a-344 n respectively for theextracted sentences 340 a-340 n as a basis for sorting the extractedsentences 340 a-340 n. Based on a sorted set of the extracted sentences346, the intelligent-text-insight system 106 generates clusters ofextracted sentences 348 a-348 n. Based on extracted sentences 352 a, 352b, and 352 n from the clusters of extracted sentences 348 a, 348 b, and348 n, respectively, the intelligent-text-insight system 106 generates aresponse summary 356 for the textual responses 336 a-336 n.

As indicated in FIG. 3D, in some embodiments, theintelligent-text-insight system 106 responds to a query by selecting thetextual responses 336 a-336 n as a set of textual responses. In somecases, the intelligent-text-insight system 106 can select the textualresponses 336 a-336 n based on a query searching for textual responsescorresponding to a topic or time period. For example, in someembodiments, the intelligent-text-insight system 106 selects the textualresponses 336 a-336 n from a response database based on each textualresponse's relevance to a topic query and recency in receipt. Such arelevance may be determined by a Lucene search-engine score based onsearch terms in a topic query, such as a score determined by a functionof a normalized TF-IDF. As a further example, in some embodiments, theintelligent-text-insight system 106 selects the textual responses 336a-336 n from the response database based on a time identifier for eachtextual response indicating that the intelligent-text-insight system 106received (or a recipient device sent) the textual response within aqueried time period.

After selecting the textual responses 336 a-336 n, theintelligent-text-insight system 106 iteratively provides the textualresponses 336 a-336 n to the response-extraction-neural network 204 torespectively generate the extracted sentences 340 a-340 n for thetextual responses 336 a-336 n. For example, in some embodiments, theintelligent-text-insight system 106 parses and tokenizes the textualresponse 336 a into a list of response sentences. Theintelligent-text-insight system 106 subsequently inputs the list ofresponse sentences (or each response sentence) from the textual response336 a into the extractor-selection network 304. Similarly, theintelligent-text-insight system 106 parses and tokenizes each of theremaining textual responses 336 b-336 n into lists of response sentencesfor input into the extractor-selection network 304.

Consistent with the disclosure above, in some embodiments, theintelligent-text-insight system 106 iteratively provides the textualresponses 336 a-336 n to the extractor-selection network 304 torespectively generate selected sentences 338 a-338 n. In some cases, theselected sentences 338 a-338 n are not paraphrased, but become theextracted sentences 340 a-340 n. By contrast, in some embodiments, theintelligent-text-insight system 106 iteratively provides the selectedsentences 338 a-338 n to the abstractor network 316 to generate theextracted sentences 340 a-340 n in paraphrased form.

In addition to (or instead of) applying the response-extraction-neuralnetwork 204 to extract sentences from the textual responses 336 a-336 n,in some embodiments, the intelligent-text-insight system 106 selectssentences from one or more of the textual responses 336 a-336 n based ontopic identifiers or tags associated with such sentences. For example,the intelligent-text-insight system 106 optionally identifies anysentence from the textual responses 336 a-336 n having a topicidentifier or tag that matches a queried topic and selects suchsentences for inclusion within the selected sentences 338 a-338 n.

As further shown in FIG. 3D, in addition to generating the extractedsentences 340 a-340 n, the intelligent-text-insight system 106 inputsthe extracted sentences 340 a-340 n into a text-analysis engine 342.Among other things, the text-analysis engine 342 performs computationalsentiment analysis to determine the sentiment indicators 344 a-344 n forthe extracted sentences 340 a-340 n, respectively. Such computationalsentiment analysis can include lexicon-based sentiment analysis (alsoknown as knowledge-based sentiment analysis) or statistical-basedsentiment analysis to determine a sentiment score or a sentiment labelfor an extracted sentence (e.g., a sentiment score between —10 and 10).As suggested above, in some embodiments, the text-analysis engine 342determines a sentiment label of positive, negative, neutral, or mixedfor each of the extracted sentences 340 a-340 n based on a correspondingsentiment score.

To perform computational sentiment analysis, theintelligent-text-insight system 106 optionally uses a Pattern Analyzeror a Natural Language Tool Kit (“NLTK”) classifier from TextBlob todetermine a sentiment score for each extracted sentence. As a furtherexample, in some embodiments, the intelligent-text-insight system 106determines sentiment scores as described by Maite Taboada et al.,“Lexicon-Based Methods for Sentiment Analysis,” 37 ComputationalLinguistics 272-274 (2011), the entire contents of which are herebyincorporated by reference.

As suggested above, in some embodiments, the text-analysis engine 342determines the sentiment indicators 344 a-344 n for specific topics. Forexample, upon receiving a query searching for textual responses bytopic, the intelligent-text-insight system 106 determines a sentimentindicator for each of the extracted sentences 340 a-340 n according tothe queried topic. Accordingly, the text-analysis engine 342 maydetermine multiple sentiment indicators for a single extracted sentencecorresponding to different topics.

As further shown in FIG. 3D, the intelligent-text-insight system 106sorts the extracted sentences 340 a-340 n into a sorted set of extractedsentences 346 based on the sentiment indicators 344 a-344 n. Whengenerating the sorted set of extracted sentences 346, theintelligent-text-insight system 106 optionally selects (i) one or moresubsets of extracted sentences corresponding to sentiment scoressatisfying a positive-sentiment-score threshold for positive sentimentsand (ii) one or more subsets of extracted sentences corresponding tosentiment scores satisfying a negative-sentiment-score threshold fornegative sentiments. In some cases, the intelligent-text-insight system106 further selects one or more subsets of extracted sentencescorresponding to sentiment scores satisfying a medium-sentiment-scorethreshold.

Sentiment-score thresholds may be relative or fixed with respect tosentiment scores. For example, a positive-sentiment-score threshold mayinclude a certain number or percentage of highest sentiment scores(e.g., extracted sentences corresponding to the highest 150 sentimentscores). A negative-sentiment-score threshold may include a certainnumber or percentage of lowest sentiment scores (e.g., extractedsentences corresponding to the lowest 150 sentiment scores). Amedium-sentiment-score threshold may include a certain number orpercentage of sentiment scores within the median of a distribution(e.g., extracted sentences corresponding to the 150 sentiment scoresclosest to a median). As a further example, a positive-sentiment-scorethreshold, a negative-sentiment-score threshold, and amedium-sentiment-score threshold may include sentiment scores above athreshold sentiment score (e.g., above +5), below a threshold sentimentscore (e.g., above −5), and within a particular range of sentiment score(e.g., within −2 and +2), respectively.

To illustrate sorting the extracted sentences 340 a-340 n, in someimplementations, the intelligent-text-insight system 106 groups theextracted sentences 340 a-340 n into (i) a positive segment comprisingextracted sentences corresponding to sentiment scores indicatingpositive sentiments and (ii) a negative segment comprising extractedsentences corresponding to sentiment scores indicating negativesentiments. The intelligent-text-insight system 106 further selects,from the positive segment, a subset of extracted sentences satisfying apositive-sentiment-score threshold. Similarly, theintelligent-text-insight system 106 selects, from the negative segment,a subset of extracted sentences satisfying a negative-sentiment-scorethreshold. In such embodiments, the sorted set of extracted sentences346 would include at least a subset of extracted sentences satisfying apositive-sentiment-score threshold and at least a subset of extractedsentences satisfying a negative-sentiment-score threshold.

As further shown in FIG. 3D, the intelligent-text-insight system 106generates the clusters of extracted sentences 348 a-348 n respectivelycorresponding to linguistic contexts 350 a-350 n. To create the clustersof extracted sentences 348 a-348 n, the intelligent-text-insight system106 optionally matches extracted sentences from the sorted set ofextracted sentences 346 to the linguistic contexts 350 a-350 n. Whenperforming such matching, the intelligent-text-insight system 106determines that extracted sentences relate to (or are surrounded by) aword, phrase, or sentence (or synonyms thereof) capturing the linguisticcontexts 350 a-350 n. Accordingly, the linguistic contexts 350 a-350 nmay be word embeddings, phrase embeddings, or sentence embeddings fromcomputational linguistics. In some cases, the clusters of extractedsentences 348 a-348 n further include extracted sentences that eachcorrespond to a positive, a negative, a neutral, or a mixed sentimentindicator (e.g., a sentiment score).

To create the clusters of extracted sentences 348 a-348 n, in certainimplementations, the intelligent-text-insight system 106 in particularapplies a smooth-inverse-frequency algorithm. For instance, theintelligent-text-insight system 106 applies a smooth-inverse-frequencyalgorithm to each subset of extracted sentences from the sorted set ofextracted sentences 346 to generate linguistic-context embeddings, whereeach of the linguistic contexts 350 a-350 n are linguistic-contextembeddings. The intelligent-text-insight system 106 further determinesan optimal number of clusters by, for example, applying the elbow methodfor K-means clustering, X-means clustering, Akaike informationcriterion, Bayesian information criterion, or another suitable method ofdetermining a number of clusters. The intelligent-text-insight system106 subsequently generates the clusters of extracted sentences 348 a-348n by identifying extracted sentences corresponding to each of thelinguistic-context embeddings. In some such cases, theintelligent-text-insight system 106 sets the number of vector dimensionsto 128 for a smooth-inverse-frequency embedding. By applying asmooth-inverse-frequency algorithm, the intelligent-text-insight system106 generates clusters of extracted sentences with a diverse set oflinguistic vectors.

To apply the smooth-inverse-frequency algorithm, theintelligent-text-insight system 106 optionally lemmatizes word tokensassigned to each extracted sentence from the sorted set of extractedsentences 346, determines word probabilities among the extractedsentences, and determines weighted averages of those words. In some suchcases, the intelligent-text-insight system 106 uses thesmooth-inverse-frequency algorithm described by Sanjeev Arora et al., “ASimple but Tough-to-Beat Baseline for Sentence Embeddings,”International Conference on Learning Representations (2017), the entirecontents of which are hereby incorporated by reference.

Although the size of each cluster in FIG. 3D appears the same, the sizeof such clusters may vary. Indeed, a size of each of the clusters ofextracted sentences 348 a-348 n may approximate the contribution orrepresentation of each cluster to the response summary 356. Accordingly,a cluster of extracted sentences corresponding to highly positivesentiment scores or highly negative sentiment scores may result in aresponse summary of highly positive sentiment or highly negativesentiment.

As further shown in FIG. 3D, the intelligent-text-insight system 106selects the extracted sentences 352 a, 352 b, and 352 n from theclusters of extracted sentences 348 a, 348 b, and 348 n, respectively.For instance, the intelligent-text-insight system 106 selects theextracted sentences 352 a, 352 b, and 352 n from the clusters ofextracted sentences 348 a, 348 b, and 348 n, based on a relevance of theextracted sentences 352 a, 352 b, and 352 n to the linguistic contexts350 a, 350 b, and 350 n, respectively. The intelligent-text-insightsystem 106 may select the extracted sentence 352 a, for instance, as themost relevant or similar to the linguistic context 350 a. As indicatedin FIG. 3D, in certain implementations, the intelligent-text-insightsystem 106 selects multiple extracted sentences from each of theclusters of extracted sentences 348 a-348 n as a basis for (or forinclusion within) the response summary 356. Accordingly, theintelligent-text-insight system 106 optionally (and additionally)selects extracted sentences 352 c, 352 d, and 352 e from the clusters ofextracted sentences 348 a, 348 b, and 348 n, respectively.

To illustrate the selection process, in some cases, theintelligent-text-insight system 106 selects the extracted sentences 352a, 352 b, and 352 n from the clusters of extracted sentences 348 a, 348b, and 348 n based on a maximum marginal relevance (“MMR”) of theextracted sentences 352 a, 352 b, and 352 n to the linguistic contexts350 a, 350 b, 350 n, respectively. For instance, theintelligent-text-insight system 106 applies an MMR algorithm to selectthe extracted sentence 352 a from the cluster of extracted sentences 348a as the nearest neighbor to a linguistic-context embedding for thecluster of extracted sentences 348 a, where the linguistic-contextembedding constitutes the linguistic context 350 a. In some suchembodiments, the intelligent-text-insight system 106 applies an MMRalgorithm to select multiple extracted sentences from each of theclusters of extracted sentences 348 a-348 n as the nearest neighbors toeach linguistic-context embedding (e.g., by selecting the two or threenearest neighbors for each linguistic-context embedding).

As further shown in FIG. 3D, the intelligent-text-insight system 106generates the response summary 356 based on the extracted sentences 352a, 352 b, and 352 n. Here, the response summary 356 represents asynopsis of the textual responses 336 a-336 n. To create arepresentative synopsis, in some embodiments, theintelligent-text-insight system 106 generates the response summary 356comprising the extracted sentences 352 a, 352 b, and 352 n or a portionof one or more of the extracted sentences 352 a, 352 b, or 352 n. Forinstance, the intelligent-text-insight system 106 may concatenate,combine, or otherwise join together the extracted sentences 352 a, 352b, and 352 n or a portion thereof to form the response summary 356.

As further indicated by FIG. 3D, in some embodiments, theintelligent-text-insight system 106 uses a summarization neural network354 to generate the response summary 356. As used in this disclosure,the term “summarization neural network” refers to an artificial neuralnetwork that compresses multiple extracted sentences selected from acluster of extracted sentences to generate a compressed sentencerepresenting a summary of the multiple extracted sentences. In somecases, a summarization neural network ranks the extracted sentences fromeach cluster of extracted sentences and compresses the ranked extractedsentences into a response summary.

In some embodiments, a summarization neural network takes the form anencoder-decoder architecture modeled by RNNs. For instance, in somecases, the summarization neural network takes the form of the sentenceencoder, document encoder, and sentence extractor described by ShashiNarayan et al., “Ranking Sentences for Extractive Summarization withReinforcement Learning,” Proceedings of North American Chapter of theAssociation for Computational Linguistics-Human Language Technologies,pp. 1747-1759 (2018), the entire contents of which are incorporated byreference.

To illustrate, in some embodiments, the intelligent-text-insight system106 inputs the extracted sentences 352 a, 352 b, 352 c, 352 d, 352 e,and 352 n into the summarization neural network 354. In some cases, theextracted sentences 352 a-352 n are input into the summarization neuralnetwork 354 as concatenated extracted sentences in a digital document.The summarization neural network 354 subsequently compresses theextracted sentences 352 a and 352 c into a compressed sentencerepresenting a summary of the extracted sentences 352 a and 352 c;compresses the extracted sentences 352 b and 352 d into a compressedsentence representing a summary of the extracted sentences 352 b and 352d; and compresses the extracted sentences 352 e and 352 n into acompressed sentence representing a summary of the extracted sentences352 e and 352 n. The summarization neural network 354 subsequentlygenerates the response summary 356 comprising each of the compressedsentences.

In addition (or in the alternative to) generating a response summary asdepicted in FIG. 3D, in some embodiments, the intelligent-text-insightsystem 106 accounts for one or both of (i) textual responses fromverified purchasers and (ii) textual responses having a threshold amountof views (or votes) in favor or against such responses. In someembodiments, the intelligent-text-insight system 106 identifies atextual response from a verified purchaser or a textual responsesatisfying a threshold amount of views or votes in a field or valuecorresponding to the response. The intelligent-text-insight system 106can subsequently use such fields or value for a filtered summaryresponse.

For instance, in certain implementations, the intelligent-text-insightsystem 106 inputs textual responses from verified purchasers into theresponse-extraction-neural network 204 based on a user-selected filterfor such verified purchasers. Upon applying a verified-purchaser filter,the intelligent-text-insight system 106 generates a response summary fortextual responses from verified purchasers. Similarly, in someembodiments, the intelligent-text-insight system 106 filters textualresponses by a threshold amount of views or votes based on auser-selected filter for views or votes and inputs such filtered textualresponses into the response-extraction-neural network 204 (e.g., byapplying a filter for the top ten, fifty, or one hundred textualresponses receiving the most views or votes, where votes can be in favoror against the response as selected by the user). Upon applying a viewfilter or a vote filter, the intelligent-text-insight system 106generates a response summary for textual responses satisfying athreshold amount of views or votes.

As noted above, in some embodiments, the intelligent-text-insight system106 receives a query from a client device searching for a set of textualresponses and subsequently provides a response summary for the set oftextual responses to the client device for display within a graphicaluser interface. In some cases, the intelligent-text-insight system 106further generates a summary comparison of different response summariescorresponding to different time periods to indicate changes over time intextual responses. FIGS. 4A-4B illustrate examples of such graphicaluser interfaces. As suggested above, the administrator application 110optionally includes computer-executable instructions that cause theadministrator device 108 to present the graphical user interfacesdepicted in FIGS. 4A-4B.

As shown in FIG. 4A, for instance, the administrator device 108 presentsa response summary 414 a within a graphical user interface 404 acorresponding to a screen 402. As indicated by various query-parameteroptions within the graphical user interface 404 a, theintelligent-text-insight system 106 receives a query from theadministrator device 108 requesting a response summary or searching fortextual responses responding to a survey question. In response toreceiving the query, the intelligent-text-insight system 106 providesthe response summary 414 a for the queried textual responses to theadministrator device 108 for display within the graphical user interface404 a.

As further shown in FIG. 4A, the intelligent-text-insight system 106provides various query-parameter options within the graphical userinterface 404 a for (i) the administrator 112 to select in constructinga query or (ii) the administrator device 108 to use as defaultparameters in a query. For example, the graphical user interface 404 aincludes a message-prompt indicator 406 indicating digital content towhich textual responses respond and for which theintelligent-text-insight system 106 identifies textual responses in aquery. While the message-prompt indicator 406 in FIG. 4A indicates asurvey question as a parameter for the query, the message-promptindicator 406 may indicate an email, a digital or online posting, a textmessage, or other digital message to which the intelligent-text-insightsystem 106 receives textual responses.

As a further example of query-parameter options, the graphical userinterface 404 a includes a time-period selector 408 a for indicating atime period corresponding to textual responses of interest to theadministrator 112. For instance, upon detecting a user selection withinthe time-period selector 408 a, the administrator device 108 presents aselected time period within which the digital survey system 104 received(or recipient devices sent) textual responses—as a parameter for thequery.

As further shown in FIG. 4A, the graphical user interface 404 a alsoincludes a selectable-topic option 410 a that, upon user selection or bydefault, indicates a topic for textual responses as a parameter for thequery. As indicated by FIG. 4A, the selectable-topic option 410 aindicates “All Topics” by default. But the administrator device 108 maychange a topic designated upon detecting a user selection by theadministrator 112. In addition to this query-parameter option, thegraphical user interface 404 a includes a search field 416 within whichthe administrator 112 can enter terms (and the administrator device 108detects terms) to search for digital messages, time periods, topics, orother parameters for a query.

As indicated by the query-parameter options in FIG. 4A, theintelligent-text-insight system 106 receives a query from theadministrator device 108 requesting a response summary (and/orsearching) for a set of textual responses defined by the user- ordefault-selections in the message-prompt indicator 406, the time-periodselector 408 a, and the selectable-topic option 410 a. Based onreceiving the query, the intelligent-text-insight system 106 identifiesa set of textual responses and generates the response summary 414 a forthe set of textual responses. The response summary 414 a includes aplain-language synopsis of textual responses to a digital messagecorresponding to the selected topic and time period in the graphicaluser interface 404 a. The response summary 414 a further includes asummary caption 412 a indicating the selected topic and time period.

Turning back now to FIG. 4B, this figure illustrates the administratordevice 108 presenting a different response summary corresponding to adifferent topic and a different time period. In particular, FIG. 4Bdepicts the administrator device 108 presenting a response summary 414 bwithin a graphical user interface 404 b corresponding to the screen 402.In this example, the intelligent-text-insight system 106 receives aquery from the administrator device 108 requesting a response summary(and/or searching) for a set of textual responses defined by the user-or default-selections in the message-prompt indicator 406, a time-periodselector 408 b, and a selectable-topic option 410 b. Based on receivingthe query, the intelligent-text-insight system 106 identifies a set oftextual responses and generates the response summary 414 b for the setof textual responses.

As shown in FIG. 4B, the response summary 414 b includes a summarycaption 412 b indicating a different selected topic and differentselected time period than the response summary 414 a shown in FIG. 4A.To identify recent changes in textual responses, the response summary414 b further includes a summary-modification indicator 418. Thesummary-modification indicator 418 identifies changes to a responsesummary in comparison to response summaries (or textual responses) forprevious time periods. In this example, the summary-modificationindicator 418 identifies a textual portion of the response summary 414 bthat is new or that has changed since a prior time period. While thetime frame and subject may differ in other examples, thesummary-modification indicator 418 identifies a change in textualresponses in comparison to a previous quarter in which theintelligent-text-insight system 106 received textual responses to aparticular survey question.

In some embodiments, the intelligent-text-insight system 106 furthergenerates a summary comparison of different response summariescorresponding to different time periods to indicates changes over timein textual responses. For example, upon receiving an indication of auser selection of a time-comparison option 420 from the administratordevice 108, the intelligent-text-insight system 106 generates andprovides an additional response summary for textual responsescorresponding to a different time period. In some such embodiments, forinstance, the intelligent-text-insight system 106 provides the responsesummary 414 b and an additional response summary (not shown) in aside-by-side comparison within a graphical user interface.

As suggested above, in some embodiments, the intelligent-text-insightsystem 106 trains a text-quality classifier. Theintelligent-text-insight system 106 applies the text-quality classifieras part of selecting representative-textual responses for a set oftextual responses. FIG. 5A illustrates the intelligent-text-insightsystem 106 training the text-quality classifier 208 to generatetextual-quality-training scores for textual-training responses based onground-truth-quality scores.

As shown in FIG. 5A, the intelligent-text-insight system 106 identifiestextual feature dimensions for the text-quality classifier 208 toanalyze upon receipt of textual responses. The intelligent-text-insightsystem 106 also iteratively provides textual-training responses to thetext-quality classifier 208 to generate textual-quality-training scoresfor the textual-training responses. In each training iteration, theintelligent-text-insight system 106 determines a loss from a lossfunction 508 based on a comparison of a textual-quality-training scoreand a ground-truth-quality score. The intelligent-text-insight system106 subsequently adjusts network parameters of the text-qualityclassifier 208 based on the determined loss. Theintelligent-text-insight system 106 then inputs a furthertextual-training response into the text-quality classifier 208 as partof a subsequent training iteration.

As just suggested, the intelligent-text-insight system 106 identifies(or sets) textual feature dimensions 502 for the text-quality classifier208 to analyze. The term “textual feature dimension” refers to acategory of linguistic characteristics for a word, phrase, or sentencewithin a written text. In particular, in some embodiments, a textualfeature dimension refers to a category of linguistic characteristics fora word, phrase, or sentence within (or metadata associated with) atextual response. Such textual feature dimensions can capture, but arenot limited to, readability indices, response composition, speechcomposition, coherence, point-of-view identification, and suggestionidentification of a textual response.

For example, textual feature dimensions include, but are not limited to,linguistic dimensions for average usefulness; part-of-speech tags; corepart-of-speech tags; Simple Measure of Gobbledygook “SMOG” index; totallength in characters; word count; suggestion identification; coherence;sentence count; Linsear Write Formula; average sentence length;determiners; numbers used in text; noun count; Gunning fog index;preposition count; conjunctions; Flesch Kincaid Grade; Dale Challreadability score; total adjective count or average number of adjectivesin a sentence; average syllables in a word, phrase, or sentence; totaladverb count or average number of adverbs in a sentence; total verbcount or average number of verbs in a sentence; average word length incharacters; foreign word count or average; Coleman-Liau index; automatedreadability index; use of third person; use of second person; use offirst person; proper nouns used; pronouns used; Flesch-Reading-Easescore; and interjections.

In some embodiments, the intelligent-text-insight system 106 limits thenumber and type of textual feature dimension to decreasecomputer-processing load. For example, in some cases, theintelligent-text-insight system 106 identifies the following textualfeature dimensions for the text-quality classifier 208: total length incharacters, average sentence length, use of first person, word count,Coleman-Liau index, automated readability index, Gunning fog index, andFlesch-Reading-Ease score.

After identifying or setting the textual feature dimensions 502 for thetext-quality classifier 208, the intelligent-text-insight system 106trains the text-quality classifier 208. In an initial trainingiteration, the intelligent-text-insight system 106 inputs atextual-training response 504 into the text-quality classifier 208. Asnoted above, the text-quality classifier 208 a linear regression,multi-layered perceptron, random-forest classifier, or support vectormachine (“SVM”). By contrast, the text-quality classifier 208 mayinclude a CNN or RNN.

Upon receiving the textual-training response 504, the text-qualityclassifier 208 analyzes textual features of the textual-trainingresponse 504 corresponding to the textual feature dimensions 502 andgenerates a textual-quality-training score 506. As used in thisdisclosure, the term “textual-quality-training score” refers to atextual quality score generated while training a text-qualityclassifier. Accordingly, a textual-quality-training score refers to ascore indicating a quality of writing in a textual-training responsebased on linguistic or textual features of the textual-trainingresponse. The text-quality classifier 208 classifies a textual-trainingresponse by classifying the response into a particular text-qualityscore (e.g., a score between 0.00 and 1.00).

As further indicated in FIG. 5A, the intelligent-text-insight system 106determines a loss from the loss function 508 based on a comparison ofthe textual-quality-training score 506 and a ground-truth-quality score.As used in this disclosure, the term “ground-truth-quality score” refersto an empirically observed or human-generated score indicating a qualityof writing in a textual-training response. In particular, in someembodiments, a ground-truth-quality score refers to an average scorefrom multiple people scoring a quality of a textual-training response.

To implement the loss function 508, in some embodiments, theintelligent-text-insight system 106 compares thetextual-quality-training score 506 and the ground-truth-quality score510 in a mean-absolute-error function. In the alternative to amean-absolute-error function, the intelligent-text-insight system 106uses an L2-loss function, cross-entropy-loss function, amean-squared-error-loss function, a root-mean-squared-error function, orother suitable loss function as the loss function 508. Upon determininga loss from the loss function 508, the intelligent-text-insight system106 adjusts network parameters (e.g., weights or values) of thetext-quality classifier 208 to decrease a loss for the loss function 508in a subsequent training iteration. For example, theintelligent-text-insight system 106 may increase or decrease weights orvalues of the text-quality classifier 208 to minimize the loss in asubsequent training iteration.

As suggested by FIG. 5A, after adjusting network parameters oftext-quality classifier 208 for an initial training iteration, theintelligent-text-insight system 106 performs additional trainingiterations until satisfying a convergence criteria. For instance, theintelligent-text-insight system 106 iteratively providestextual-training responses to the text-quality classifier 208 togenerate textual-quality-training scores, iteratively determines lossesfrom the loss function 508 based on comparisons of thetextual-quality-training scores and ground-truth-quality scores, anditeratively adjusts network parameters of the text-quality classifier208 based on the determined losses. In some cases, theintelligent-text-insight system 106 performs training iterations untilthe value or weights of the text-quality classifier 208 do not changesignificantly across training iterations.

In addition (or in the alternative) to training the text-qualityclassifier 208, the intelligent-text-insight system 106 applies thetext-quality classifier 208, computational sentiment analysis, and othertext-based determinations to generate representative-textual responsesfor a set of textual responses. FIG. 5B illustrates one such embodimentof the intelligent-text-insight system 106.

As shown in FIG. 5B, the intelligent-text-insight system 106 providestextual responses 512 a-512 n to the text-quality classifier 208 torespectively generate textual quality scores 516 a-516 n. For thetextual responses 512 n-512 n, the intelligent-text-insight system 106further (and respectively) determines relevancy parameters 520 a-520 n,topics 522 a-522 n, and sentiment indicators 524 a-524 n. Based on thetopics 522 a-522 n and the sentiment indicators 524 a-524 n, theintelligent-text-insight system 106 generates response groups of textualresponses 528 a-528 n. Based on the relevancy parameters correspondingto the textual responses within the first and second response groups,the intelligent-text-insight system 106 selects representative-textualresponses 532 a-532 n as representative of the textual responses 512a-512 n.

As indicated in FIG. 5B, in some embodiments, theintelligent-text-insight system 106 responds to a query by selecting thetextual responses 512 a-512 n as a set of textual responses. In somecases, the intelligent-text-insight system 106 can select the textualresponses 512 a-512 n based on a query searching for textual responsescorresponding to a topic or time period. For example, in certainimplementations, the intelligent-text-insight system 106 selects thetextual responses 512 a-512 n from a response database based on eachtextual response's relevance to a topic query and recency in receipt. Asa further example, in some embodiments, the intelligent-text-insightsystem 106 selects the textual responses 512 a-512 n from the responsedatabase based on a time identifier for each textual response indicatingthat the intelligent-text-insight system 106 received (or a recipientdevice sent) the textual response within a queried time period.

As further shown in FIG. 5B, the intelligent-text-insight system 106inputs the textual responses 512 a-512 n into the text-qualityclassifier 208. Critically, the intelligent-text-insight system 106 mayinput the textual responses 512 a-512 n into the text-quality classifier208 either before or after receiving a query from a client device (e.g.,a query by time period or topic). For instance, in some embodiments, theintelligent-text-insight system 106 inputs each of the textual responses512 a-512 n into the text-quality classifier 208 upon receiving eachtextual response. Accordingly, the intelligent-text-insight system 106may input more textual responses into the text-quality classifier 208than those depicted in FIG. 5B. Upon input, and consistent with thedisclosure above, the intelligent-text-insight system 106 uses thetext-quality classifier 208 to respectively generate textual qualityscores 516 a-516 n for the textual responses 512 a-512 n.

In addition to generating textual quality scores, theintelligent-text-insight system 106 inputs the textual responses 512a-512 n into a text-analysis engine 518. As above, theintelligent-text-insight system 106 may input the textual responses 512a-512 n into the text-analysis engine 518 to determine any relevancyparameter, topic, or sentiment indicator either before or afterreceiving a query from a client device or generating textual qualityscores. As shown in FIG. 5B, the intelligent-text-insight system 106uses the text-analysis engine to determine the relevancy parameters 520a-520 n, the topics 522 a-522 n, and the sentiment indicators 524 a-524n for the textual responses 512 n-512 n, respectively. The followingparagraphs describe the intelligent-text-insight system 106 determiningeach of these outputs in turn with reference to pertinent descriptionsabove.

To determine the relevancy parameters 520 a-520 n, in some embodiments,the text-analysis engine 518 determines one or both of atopic-similarity score and a recency of a textual response. Uponreceiving a query searching for textual responses corresponding to atopic, for instance, the text-analysis engine 518 determinestopic-similarity scores for textual responses within a response databaseusing a search-engine algorithm. In some such embodiments, thetext-analysis engine 518 determines a Lucene-based search-engine scoreby, for example, using ElasticSearch, Lucene, Solr, or SenseiDB.

Additionally, or alternatively, in certain implementations, thetext-analysis engine 518 determines a recency of a textual response byidentifying a time identifier for each textual response within aresponse database. Such a time identifier indicates a time at which theintelligent-text-insight system 106 received (or a recipient devicesent) the textual response. Upon receiving a query searching for textualresponses corresponding to a time period, for instance, thetext-analysis engine 518 identifies textual responses associated withtime identifiers corresponding to the time period (e.g., within thequeried time period). Accordingly, in some embodiments, the relevancyparameters 520 a-520 n can comprise either (i) a combination oftopic-similarity scores and time identifiers in response to a queriedtopic or (ii) time identifiers in response to a queried time period.

To determine the topics 522 a-522 n, in some embodiments, thetext-analysis engine 518 identifies a topic tag or topic identifierassociated with each textual response from a response database. Suchtopic tags or identifiers may be associated with a particular sentenceor with a textual response as a whole. For example, the text-analysisengine 518 may identify topic tags or topic identifiers for a particularemployee, company, location, organization, model, product, service,survey question, keyword, or other topic. In some embodiments, the topictag or topic identifier may match (or be identified as related to) aqueried topic from a client device. Additionally, in some cases, thetext-analysis engine 518 identifies an inverse-topic tag or aninverse-topic identifier associated with a textual response, where thetag or identifier identifies a topic with which a textual response isnot associated.

To determine the sentiment indicators 524 a-524 n, in some embodiments,the text-analysis engine 518 determines one or both of a sentiment labeland sentiment score for each textual response within a responsedatabase. In doing so, the text-analysis engine 518 may use any of thecomputational sentiment analyses described above. In some cases, such asentiment label and a sentiment score indicate a positive sentiment, anegative sentiment, a mixed sentiment, or a neutral sentiment. Further,a sentiment label or a sentiment score may be specific to a responsesentence within a textual response or applicable to an entire textualresponse.

As part of (or in addition to) determining the sentiment indicators 524a-524 n, the intelligent-text-insight system 106 optionally determines asentiment-polarity indicator for each textual response within a set oftextual responses based on a corresponding sentiment score. Forinstance, the intelligent-text-insight system 106 may determine asentiment-polarity indicator by identifying that a textual responsecorresponds to a sentiment score satisfying a positive-sentiment-scorethreshold (e.g., sentiment scores more than or equal to +8). Conversely,the intelligent-text-insight system 106 may further determine asentiment-polarity indicator by identifying that a textual responsecorresponds to a sentiment score satisfying a negative-sentiment-scorethreshold (e.g., sentiment scores less than or equal to −8).

As further indicated by FIG. 5B, the intelligent-text-insight system 106optionally sorts the textual responses 512 a-512 n to create a sortedset of textual responses 526. For instance, the intelligent-text-insightsystem 106 optionally sorts the textual responses 512 a-512 n based onone or both of the textual quality scores 516 a-516 n and the relevancyparameters 520 a-520 n. As an initial step in sorting, in some cases,the intelligent-text-insight system 106 filters out textual responsescorresponding to relatively low textual quality scores. From among thetextual responses 512 a-512 n, for instance, theintelligent-text-insight system 106 may select textual responsessatisfying a textual-quality-score threshold (e.g., textual responsescorresponding to a textual quality score more than or equal toparticular score).

After filtering by textual quality score, the intelligent-text-insightsystem 106 can sort the textual responses satisfying thetextual-quality-score threshold according to relevancy parameters forthe selected textual responses. By filtering by textual quality scoreand relevancy parameters, in certain implementations, theintelligent-text-insight system 106 creates the sorted set of textualresponses 526 from the textual responses 512 a-512 n.

As further shown in FIG. 5B, independent of whether theintelligent-text-insight system 106 sorts textual responses, theintelligent-text-insight system 106 generates the response groups oftextual responses 528 a-528 n. To generate such response groups, in someembodiments, the intelligent-text-insight system 106 groups textualresponses according to sentiment indicators and topics. For instance,the intelligent-text-insight system 106 can generate the response groupsof textual responses 528 a-528 n based on the sentiment indicators andtopics corresponding to the textual responses within the sorted set oftextual responses 526.

In some embodiments, the response groups of textual responses 528 a-528n correspond to different ranges of sentiment scores. For instance, theresponse group of textual responses 528 a may correspond to a range ofpositive sentiment scores more than or equal to apositive-sentiment-score threshold. Similarly, the response group oftextual responses 528 b may correspond to a range of negative sentimentscores less than or equal to a negative-sentiment-score threshold.Further, the response group of textual responses 528 b may correspond toa particular range of sentiment scores (e.g., a range of sentimentscores indicating a neutral sentiment).

In some such embodiments, the intelligent-text-insight system 106 groupstextual responses from the sorted set of textual responses 526 based inpart on sentiment-polarity indicators corresponding to textualresponses. Accordingly, one or more of the response groups of textualresponses 530 a-530 n may include textual responses corresponding tosentiment scores satisfying (or not satisfying) apositive-sentiment-score threshold or a negative-sentiment-scorethreshold.

As further shown in FIG. 5B, the response group of textual responses 528a includes textual responses 530 a, where each textual responsecorresponds to a relevancy parameter, a topic, and a sentiment indicator(e.g., as identified by meta data). Similarly, the response group oftextual responses 528 b includes textual responses 530 b, where eachtextual response corresponds to a relevancy parameter, a topic, and asentiment indicator. Further, the response group of textual responses528 n includes textual responses 530 n, where each textual responsecorresponds to a relevancy parameter, a topic, and a sentimentindicator. As described below, the intelligent-text-insight system 106may use one or more of the relevancy parameters, topics, or sentimentindicators in each response group to select representative-textualresponses.

After generating response groups, the intelligent-text-insight system106 selects, for display on a client device, a representative-textualresponse from each of the response groups of textual responses 528 a-528n. In particular, the intelligent-text-insight system 106 selectsrepresentative-textual responses 532 a-532 n from the response groups oftextual responses 528 a-528 n, respectively. As shown in FIG. 5B, therepresentative-textual responses 532 a-532 n are each indicative orrepresentative of the textual responses 512 a-512 n.

To select representative-textual responses, in some embodiments, theintelligent-text-insight system 106 selects representative-textualresponses based on relevancy. In some such cases, for instance, theintelligent-text-insight system 106 selects a representative-textualresponse from each of the response groups of textual responses 528 a-528n by identifying a textual response corresponding to a highest relevancyparameter. By selecting a textual response corresponding to a highestrelevancy parameter, the intelligent-text-insight system 106 selects atextual response corresponding to one or both of a highertopic-similarity score and a more current recency from a response groupof textual responses.

Additionally, or alternatively, in some embodiments, theintelligent-text-insight system 106 selects a representative-textualresponse from each of the response groups of textual responses 528 a-528n based on textual quality score and relevancy. For instance, as notedabove, the intelligent-text-insight system 106 optionally determines arelevancy parameter for each textual response by determining atopic-similarity score indicating a relevance of a textual response to aqueried topic and a time identifier indicating a time associated with atextual response (e.g., a time stamp for time of receipt or sending).The intelligent-text-insight system 106 can accordingly select arepresentative-textual response from each response group by identifyinga textual response based on topic-similarity scores and time identifiersamong each response group.

Together with textual quality score, the intelligent-text-insight system106 may select a textual response corresponding to factors in thefollowing priority order: a highest topic-similarity score, a mostrecent time identifier, and a highest textual quality score among eachresponse group. But the intelligent-text-insight system 106 can reorderthe factors in any priority order to select a textual response from aresponse group.

As noted above, in some embodiments, the intelligent-text-insight system106 receives a query from a client device searching for a set of textualresponses and subsequently provides representative-textual responses forthe set of textual responses to the client device for display within agraphical user interface. FIGS. 6A-6B illustrate examples of suchgraphical user interfaces. As suggested above, the administratorapplication 110 optionally includes computer-executable instructionsthat cause the administrator device 108 to present the graphical userinterfaces depicted in FIGS. 6A-6B.

As shown in FIG. 6A, for instance, the administrator device 108 presentsrepresentative-textual responses 620 a-620 c within a graphical userinterface 604 a corresponding to a screen 602. As indicated by variousquery-parameter options within the graphical user interface 604 a, theintelligent-text-insight system 106 receives a query from theadministrator device 108 requesting representative-textual responses orsearching for textual responses responding to a survey question. Inresponse to receiving the query, the intelligent-text-insight system 106provides the representative-textual responses 620 a-620 c for thequeried textual responses to the administrator device 108 for displaywithin the graphical user interface 406 a.

As further shown in FIG. 6A, the intelligent-text-insight system 106provides various query-parameter options within the graphical userinterface 604 a for (i) the administrator 112 to select in constructinga query or (ii) the administrator device 108 to use as defaultparameters in a query. This disclosure names a few such query-parameteroptions. First, the graphical user interface 604 a includes amessage-prompt indicator 606 indicating digital content to which textualresponses respond and for which the intelligent-text-insight system 106identifies textual responses in a query. Second, the graphical userinterface 604 a includes a search field 614 within which theadministrator 112 can enter terms (and the administrator device 108detects terms) to search for parameters for a query. Third, thegraphical user interface 604 a includes a time-period selector 610 a forindicating a time period corresponding to textual responses of interestto the administrator 112. Fourth, the graphical user interface 604 aalso includes a selectable-topic option 612 a that, upon user selectionor by default, indicates a topic for textual responses as a parameterfor the query.

As indicated by the query-parameter options in FIG. 6A, theintelligent-text-insight system 106 receives a query from theadministrator device 108 requesting representative-textual responses(and/or searching) for a set of textual responses defined by the user-or default-selections in the message-prompt indicator 606, thetime-period selector 610 a, and the selectable-topic option 612 a. Basedon receiving the query, the intelligent-text-insight system 106identifies a set of textual responses and generates therepresentative-textual responses 620 a-620 c for the set of textualresponses. As shown in FIG. 6A, the representative-textual responses 620a-620 c include textual responses representative (or indicative) ofresponses to a digital message corresponding to the selected topic andtime period.

As further shown in FIG. 6A, the intelligent-text-insight system 106provides representative-response indicators that correspond to (anddescribe) the representative-textual responses 620 a-620 c within thegraphical user interface 604 a. For instance, the representative-textualresponses 620 a-620 c correspond to a summary caption 616 a indicatingthe selected topic and time period. The representative-textual responses620 a-620 c also respectively correspond to sentiment scores 618 a-618 cshown in the graphical user interface 604 a. Each of the sentimentscores 618 a-618 c in turn correspond to a sentiment label indicated ona sentiment-label key 608. As the sentiment scores 618 a-618 c and thesentiment-label key 608 indicate, in some embodiments, theintelligent-text-insight system 106 selects each of therepresentative-textual responses 620 a-620 c from a correspondingresponse group of textual responses. Such response groups may includetextual responses corresponding to a range of sentiment scores or to aparticular sentiment label (e.g., positive sentiment, neutral sentiment,negative sentiment).

In addition to the sentiment scores 618 a-618 c, theintelligent-text-insight system 106 provides similar-response options622 a-622 c respectively corresponding to the representative-textualresponses 620 a-620 c within the graphical user interface 604 a. Uponselection of a similar-response option, the intelligent-text-insightsystem 106 provides additional textual responses similar to acorresponding representative-textual response. In some embodiments, forinstance, the intelligent-text-insight system 106 receives an indicationof a user selection of one of the similar-response options 622 a-622 cfrom the administrator device 108. In response to receiving anindication of such a user selection, the intelligent-text-insight system106 provides, for display on the administrator device 108, one or moretextual responses similar to a corresponding representative-textualresponse from the representative-textual responses 620 a-620 c.

For example, in response to receiving an indication of a user selectionof the similar-response option 622 a, the intelligent-text-insightsystem 106 provides, for display on the administrator device 108, one ormore textual responses similar to the representative-textual response620 a. To select and display one or more similar textual responses, insome embodiments, the intelligent-text-insight system 106 selectsadditional representative-textual responses from a response group oftextual responses from which the representative-textual response 620 awas selected. Consistent with the disclosure above, theintelligent-text-insight system 106 can select additionalrepresentative-textual responses from a response group of textualresponses based on one or more of textual quality scores or relevancyparameters, including topic-similarity scores and time identifiers.

Turning back now to FIG. 6B, this figure illustrates the administratordevice 108 presenting different representative-textual responsescorresponding to a different topic and a different time period. Inparticular, FIG. 6B depicts the administrator device 108 presentingrepresentative-textual responses 620 d-620 f within a graphical userinterface 604 b corresponding to the screen 602. In this example, theintelligent-text-insight system 106 receives a query from theadministrator device 108 requesting representative-textual responses(and/or searching) for a set of textual responses defined by the user-or default-selections in the message-prompt indicator 606, a time-periodselector 610 b, and a selectable-topic option 612 b. Based on receivingthe query, the intelligent-text-insight system 106 identifies a set oftextual responses and generates the representative-textual responses 620d-620 f for the set of textual responses.

As further shown in FIG. 6B, the intelligent-text-insight system 106provides representative-response indicators that correspond to (anddescribe) the representative-textual responses 620 d-620 f within thegraphical user interface 604 b. For instance, the representative-textualresponses 620 d-620 f correspond to a summary caption 616 b indicating adifferent selected topic and a different time period and to sentimentscores 618 d-618 f. In addition to the sentiment scores 618 d-618 f, theintelligent-text-insight system 106 provides similar-response options622 d-622 f respectively corresponding to the representative-textualresponses 620 d-620 f within the graphical user interface 604 b. Thesimilar-response options 622 d-622 f function similarly to thesimilar-response options 622 a-622 c above, but with respect to therepresentative-textual responses 620 d-620 f.

As suggested above, in some embodiments, the intelligent-text-insightsystem 106 generates a response summary and representative-textualresponses for a same set of textual responses. For instance, in responseto a query searching for a set of textual responses corresponding to asame topic and time period, the intelligent-text-insight system 106optionally generates the response summary 414 a and therepresentative-textual responses 620 a-620 c for display within agraphical user interface. Similarly, in response to a query searchingfor a different set of textual responses corresponding to a differenttopic and a different time period, the intelligent-text-insight system106 optionally generates the response summary 414 b and therepresentative-textual responses 620 d-620 f for display within agraphical user interface.

Turning now to FIGS. 7A-7B, these figures illustrate sequence-flowdiagrams of the intelligent-text-insight system 106 using a distributednetwork architecture of hardware and software to selectrepresentative-textual responses from a set of textual responses inresponse to a query by topic. As shown in FIGS. 7A-7B, a caller 702represents hardware and/or software components that send queries forrepresentative-textual responses. A text engine 704 represents hardwareand/or software components that receive and process queries forrepresentative-textual responses. The document-database manager 706 andthe document-search-database manager 708 respectively represent adocument-oriented-database program, such as MongoDB, and adatabase-management system, such as CrateDB. The analytics engine 710represents a data-analytics engine that retrieves and filters textualresponses.

As shown in FIG. 7A, for example, the caller 702 sends arepresentative-response query 712 to the text engine 704 requestingrepresentative-textual responses for a set of textual responsescorresponding to a topic. The text engine 704 optionally sends atopic-definition request 714 to the document-database manager 706requesting query parameters that define the topic consistent with therepresentative-response query 712. The document-database manager 706 inturn sends a topic-definition 716 defining query parameters for thetopic. Additionally, the text engine 704 optionally adds topic tags tothe query 718 by adding topic tags corresponding (or relevant) to thequery parameters for the topic.

As further shown in FIG. 7A, the text engine 704 sends arelevant-quality-response request 722 to the document-search-databasemanager 708 searching for textual responses that match (or a responsiveto) the queried topic, satisfy a topic-similarity-score threshold, andsatisfy a textual-quality-score threshold. In response to therelevant-quality-response request 722, the document-search-databasemanager 708 sends relevant-quality-responses 724 to the text engine 704.For example, the document-search-database manager 708 sends textualresponses that match (or a responsive to) the queried topic, satisfy thetopic-similarity-score threshold, and satisfy the textual-quality-scorethreshold. In some embodiments, the text engine 704 removes topic untags726 by removing topic a specific type of tag (e.g., inverse topic tags)that identify topics with which a textual response is not associated.

As further indicated by FIG. 7A, the text engine 704 sends asentiment-topic request 728 to the analytics engine 710 requesting thatthe analytics engine 710 identify textual responses from therelevant-quality-responses 724 according to particular sentimentindicators and topic. The sentiment-topic request 728 optionallyincludes a request to filter textual responses by sentiment indicators(e.g., sentiment scores and sentiment-polarity indicators). In responseto the sentiment-topic request 728, the analytics engine 710 sendssorted responses 730 to the text engine 704 comprising textual responsesthat match the particular sentiment indicators and topic and(optionally) have been filtered according to sentiment indicators. Asfurther indicated by a loop 720, the intelligent-text-insight system 106optionally performs the actions within the loop 720 until the textengine 704 receives a sufficient number of textual response satisfyingthe representative-response query 712.

In addition to retrieving the sorted responses 730, in some embodiments,the text engine 704 further stratifies and samples responses 732 byranking and selecting textual responses from response groups of textualresponses for display on a client device. Upon selectingrepresentative-textual responses, the text engine 704 sends therepresentative-textual responses 734 to the caller 702 in response tothe representative-response query 712. The representative-textualresponses 734 are representative or indicative of a set of textualresponses indicated by the representative-response query 712.

As noted above, FIG. 7A depicts an example network architecture in aquery by topic. By contrast, in a query by time period without a topic,the intelligent-text-insight system performs the same actions shown inFIG. 7A, but without the topic-definition request 714 or thetopic-definition 716 from the document-database manager 706.

FIG. 7B depicts an alternative network architecture used by theintelligent-text-insight system 106 to select representative-textualresponses from a set of textual responses in response to a query bytopic. As indicated by FIG. 7B, the intelligent-text-insight system 106uses data models comprising a number of textual responses that theintelligent-text-insight system 106 identifies as responsive to digitalcontent (e.g., a particular survey question) before queries. In someembodiments, the intelligent-text-insight system 106 identifies textualresponses representative of other textual responses—that is,representative-textual responses—and includes those within the responsegroups. As part of using such data models, the intelligent-text-insightsystem 106 groups the identified textual responses into response groupsconsistent with the disclosure above.

In some such embodiments, the intelligent-text-insight system 106frequently or periodically updates data models (or updates such modelsafter each query) by performing actions 714-732 in FIG. 7A—but inresponse to an update-model request from the caller 702. By creatingsuch preformed response groups of textual responses before receiving aquery and updating data models, the intelligent-text-insight system 106expedites retrieval of representative-textual responses for a query.

As shown in FIG. 7B, for example, the caller 702 sends arepresentative-response query 736 to the text engine 704 requestingrepresentative-textual responses for a set of textual responsescorresponding to a topic. The text engine 704 subsequently sends atopic-definition request 738 to the document-database manager 706requesting query parameters that define the topic consistent with therepresentative-response query 736. The document-database manager 706 inturn sends a topic-definition 740 defining query parameters for thetopic.

As further shown in FIG. 7B, the text engine 704 sends a data-modelrequest 742 to the document-database manager 706 requesting textualresponse from preformed response groups of textual responsescorresponding to the query parameters defining the topic. In response tothe data-model request 742, the document-database manager 706 sends datamodels 744 comprising textual responses from the preformed responsegroups. In some embodiments, the text engine 704 further removes topicuntags 746 by removing topic specific tags (e.g., inverse topic tags)that identify topics with which textual responses are not associated.

In addition to retrieving the data models 744, the text engine 704further stratifies and samples responses 748 by ranking and selectingtextual responses from the data models 744 for display on a clientdevice. Upon selecting representative-textual responses, the text engine704 sends the representative-textual responses 750 to the caller 702 inresponse to the representative-response query 736. Therepresentative-textual responses 750 are representative or indicative ofa set of textual responses indicated by the representative-responsequery 736.

Turning now to FIG. 8, this figure illustrates a flowchart of a seriesof acts 800 of applying a response-extraction-neural network andcomputational sentiment analysis to generate a response summary for aset of textual responses in accordance with one or more embodiments.While FIG. 8 illustrates acts according to one embodiment, alternativeembodiments may omit, add to, reorder, and/or modify any of the actsshown in FIG. 8. The acts of FIG. 7 can be performed as part of amethod. Alternatively, a non-transitory computer readable storage mediumcan comprise instructions that, when executed by one or more processors,cause a computing device to perform the acts depicted in FIG. 8. Instill further embodiments, a system can perform the acts of FIG. 8.

As shown in FIG. 8, the acts 800 include an act 810 of providing a setof textual responses to a response-extraction-neural network to generatea set of extracted sentences. In particular, in some embodiments, theact 810 includes providing a set of textual responses to aresponse-extraction-neural network to generate a set of extractedsentences, wherein each extracted sentence corresponds to a selectedsentence from the set of textual responses.

For instance, in certain implementations, generating the set ofextracted sentences comprises: extracting selected sentences from theset of textual responses utilizing an extractor-selection network of theresponse-extraction-neural network; and paraphrasing one or moresentences from the selected sentences utilizing an abstractor network ofthe response-extraction-neural network.

As further shown in FIG. 8, the acts 800 include an act 820 ofdetermining a sentiment indicator for each extracted sentence. Inparticular, in some embodiments, the act 820 includes determining asentiment indicator for each extracted sentence from the set ofextracted sentences to sort the set of extracted sentences. Forinstance, in some cases, determining the sentiment indicator for eachextracted sentence comprises determining, for each extracted sentence,one or both of a sentiment score and a sentiment label indicating apositive sentiment or a negative sentiment.

As further shown in FIG. 8, the acts 800 include an act 830 ofgenerating a first cluster of extracted sentences and a second clusterof extracted sentences. In particular, in some embodiments, the act 830includes, based on a sorted set of extracted sentences, generating afirst cluster of extracted sentences corresponding to a first contextand a second cluster of extracted sentences corresponding to a secondcontext.

As further shown in FIG. 8, the acts 800 include an act 840 ofgenerating a response summary based on a first extracted sentence fromthe first cluster and a second extracted sentence from the secondcluster. In particular, in certain implementations, the act 830 includesgenerating a response summary for the set of textual responses based ona first extracted sentence from the first cluster of extracted sentencesand a second extracted sentence from the second cluster of extractedsentences.

For instance, in certain implementations, generating the responsesummary for the set of textual responses based on the first extractedsentence and the second extracted sentence comprises generating theresponse summary for the set of textual responses comprising both thefirst extracted sentence and the second extracted sentence. In some suchembodiments, generating the response summary for the set of textualresponses based on the first extracted sentence and the second extractedsentence comprises generating the response summary for the set oftextual responses comprising a portion of the first extracted sentenceand a portion of the second extracted sentence.

In one or more embodiments, selecting the first extracted sentence andthe second extracted sentence for the response summary comprises:selecting the first extracted sentence from the first cluster ofextracted sentences based on a relevance of the first extracted sentenceto a first linguistic context embedding; and selecting the secondextracted sentence from the second cluster of extracted sentences basedon a relevance of the second extracted sentence to a second linguisticcontext embedding. In particular, in some cases, selecting the firstextracted sentence and the second extracted sentence for the responsesummary comprises: selecting the first extracted sentence from the firstcluster of extracted sentences based on a maximum marginal relevance ofthe first extracted sentence to the first linguistic context embedding;and selecting the second extracted sentence from the second cluster ofextracted sentences based on a maximum marginal relevance of the secondextracted sentence to the second linguistic context embedding.

By contrast, in certain implementations, selecting the first extractedsentence and the second extracted sentence for the response summarycomprises: selecting the first extracted sentence and a third extractedsentence from the first cluster of extracted sentences based on arelevance of each of the first extracted sentence and the thirdextracted sentence to a first linguistic context embedding; selectingthe second extracted sentence and a fourth extracted sentence from thesecond cluster of extracted sentences based on a relevance of each ofthe second extracted sentence and the fourth extracted sentence to asecond linguistic context embedding; providing the first, second, third,and fourth extracted sentences to a summarization neural network togenerate: a first compressed sentence representing a summary of thefirst extracted sentence and the third extracted sentence: and a secondcompressed sentence representing a summary of the second extractedsentence and the fourth extracted sentence; and generating the responsesummary for the set of textual responses comprising the first compressedsentence and the second compressed sentence.

In addition to the acts 810-840, the acts 800 may include additions orvariations. In certain implementations, for instance, the acts 800include providing the set of textual responses to theresponse-extraction-neural network by: receiving, from a client device,a user query searching for textual responses corresponding to a timeperiod and responding to a survey question; and selecting the set oftextual responses based on a time identifier for each textual responsecorresponding to the time period indicated by the user query; anddetermining the sentiment indicator for each extracted sentence from theset of extracted sentences by determining, for each extracted sentence,a sentiment score indicating a positive sentiment or a negativesentiment. In some cases, the acts 800 further include providing theresponse summary for the set of textual responses to the client devicefor display within a graphical user interface.

By contrast, in one or more embodiments, the acts 800 include providingthe set of textual responses to the response-extraction-neural networkby: receiving, from a client device, a user query searching for textualresponses corresponding to a topic and responding to a survey question;and selecting the set of textual responses based on a relevance of eachtextual response from the set of textual responses to the user query;and determining the sentiment indicator for each extracted sentence fromthe set of extracted sentences by determining, for each extractedsentence, a sentiment score indicating a positive sentimentcorresponding to the topic or a negative sentiment corresponding to thetopic. In some cases, the acts 800 further include providing theresponse summary for the set of textual responses to the client devicefor display within a graphical user interface.

Further, in certain implementations, the acts 800 include determiningthe sentiment indicator for each extracted sentence by determining asentiment score indicating a positive sentiment or a negative sentimentfor each extracted sentence; and generating the first cluster ofextracted sentences corresponding to the first context and the secondcluster of extracted sentences corresponding to the second context by:generating the first cluster of extracted sentences corresponding topositive sentiment scores and to a first linguistic context embedding;and generating the second cluster of extracted sentences correspondingto negative sentiment scores and to a second linguistic contextembedding.

In addition to the acts 810-840, in some embodiments, the acts 800further include generating the response summary for the set of textualresponses in response to a query for textual responses corresponding toa time period and responding to a survey question; generating anadditional response summary for an additional set of textual responsesin response to an additional query for additional textual responsescorresponding to an additional time period and responding to the surveyquestion; and providing, for display on a client device, a summarycomparison of the response summary for the set of textual responses andthe additional response summary for the additional set of textualresponses.

Additionally, in certain implementations, the acts 800 further includegenerating the sorted set of extracted sentences comprises: selecting,from the set of textual responses, a first subset of extracted sentencescorresponding to sentiment scores satisfying a positive-sentiment-scorethreshold for positive sentiments; and selecting, from the set oftextual responses, a second subset of extracted sentences correspondingto sentiment scores satisfying a negative-sentiment-score threshold fornegative sentiments.

In particular, in some cases, generating the sorted set of extractedsentences comprises: grouping the set of extracted sentences into apositive segment comprising extracted sentences corresponding tosentiment scores indicating positive sentiments and a negative segmentcomprising extracted sentences corresponding to sentiment scoresindicating negative sentiments; selecting, from the positive segment, afirst subset of extracted sentences satisfying apositive-sentiment-score threshold; and selecting, from the negativesegment, a second subset of extracted sentences satisfying anegative-sentiment-score threshold.

Further, in one or more embodiments, the acts 800 include matching eachextracted sentence from the first subset of extracted sentences to afirst linguistic-context embedding and each extracted sentence from thesecond subset of extracted sentences to a second linguistic-contextembedding.

In particular, in certain implementations, generating the first clusterof extracted sentences corresponding to the first context and the secondcluster of extracted sentences corresponding to the second contextcomprises: applying a smooth-inverse-frequency algorithm to the firstsubset of extracted sentences to generate a first linguistic contextembedding and a third linguistic context embedding; generating the firstcluster of extracted sentences by identifying, from among the firstsubset of extracted sentences, extracted sentences corresponding to thefirst linguistic context embedding; applying thesmooth-inverse-frequency algorithm to the second subset of extractedsentences to generate a second linguistic context embedding; andgenerating the second cluster of extracted sentences by identifying,from among the second subset of extracted sentences, extracted sentencescorresponding to the second linguistic context embedding.

As noted above, in some embodiments, the intelligent-text-insight system106 trains a response-extraction-neural network. Accordingly, in somecases, the acts 800 include training an extractor-selection network ofthe response-extraction-neural network by: providing a textual-trainingresponse to the extractor-selection network of theresponse-extraction-neural network to extract a selected-trainingsentence from the textual-training response; determining a first lossfrom a first loss function based on a comparison of theselected-training sentence and a similar-response sentence correspondingto a ground-truth-paraphrased sentence; and adjusting network parametersof the extractor-selection network based on the determined first loss.

Relatedly, in some cases, the acts 800 include training an abstractornetwork of the response-extraction-neural network by: providing theselected-training sentence from the textual-training response to theabstractor network of the response-extraction-neural network to generatea paraphrased-training sentence; determining a second loss from a secondloss function based on a comparison of the paraphrased-training sentenceand the ground-truth-paraphrased sentence; and adjusting networkparameters of the abstractor network based on the determined secondloss.

Turning now to FIG. 9, this figure illustrates a flowchart of a seriesof acts 900 of applying a textual-quality classifier and computationalsentiment analysis to select representative-textual responses from a setof textual responses in accordance with one or more embodiments. WhileFIG. 9 illustrates acts according to one embodiment, alternativeembodiments may omit, add to, reorder, and/or modify any of the actsshown in FIG. 9. The acts of FIG. 9 can be performed as part of amethod. Alternatively, a non-transitory computer readable storage mediumcan comprise instructions that, when executed by one or more processors,cause a computing device to perform the acts depicted in FIG. 9. Instill further embodiments, a system can perform the acts of FIG. 9.

As shown in FIG. 9, the acts 900 include an act 910 of providing a setof textual responses to a text-quality classifier to generate a textualquality score for each textual response. In particular, in someembodiments, the act 910 includes providing a set of textual responsesto a text-quality classifier to generate a textual quality score foreach textual response from the set of textual responses.

For example, in some cases, providing the set of textual responses tothe text-quality classifier to generate the textual quality score foreach textual response comprises: inputting the set of textual responsesinto a random-forest classifier; and utilizing the random-forestclassifier to generate the textual quality score for each textualresponse based on textual feature corresponding to each textualresponse.

As further shown in FIG. 9, the acts 900 include an act 920 ofdetermining a relevancy parameter, a sentiment indicator, and a topicfor each textual response. In particular, in some embodiments, the act820 includes determining a relevancy parameter, a sentiment indicator,and a topic for each textual response from the set of textual responses,wherein a given relevancy parameter indicates a relevance of a giventextual response to a user query, a given sentiment indicator indicatesa linguistic sentiment of the given textual response, and a given topicindicates a subject matter of the given textual response.

As further shown in FIG. 9, the acts 900 include an act 930 ofgenerating a first response group of textual responses and a secondresponse group of textual responses from the set of textual responses.In particular, in some embodiments, the act 930 includes generating afirst response group of textual responses and a second response group oftextual responses from the set of textual responses based on thesentiment indicator and the topic for each textual response from the setof textual responses.

As further shown in FIG. 9, the acts 900 include an act 940 of selectinga first representative-textual response from the first response groupand a second representative-textual response from the second responsegroup. In particular, in some embodiments, the act 940 includesselecting, for display on a client device, a firstrepresentative-textual response from the first response group and asecond representative-textual response from the second response groupbased on the relevancy parameter for each textual response within thefirst response group and the relevancy parameter for each textualresponse within the second response group.

In addition to the acts 910-940, the acts 900 may include additions orvariations. For instance, in some embodiments, the acts 900 includeselecting the set of textual responses by: receiving, from a clientdevice, a user query searching for textual responses corresponding to atime period and responding to a survey question; and selecting the setof textual responses based on a time identifier for each textualresponse corresponding to the time period indicated by the user query.

By contrast, in certain implementations, the acts 900 include selectingthe set of textual responses by: receiving, from a client device, a userquery searching for textual responses corresponding to a topic andresponding to a survey question; and selecting the set of textualresponses based on a relevance of each textual response from the set oftextual responses to the user query.

As further suggested above, in certain implementations, the acts 900include sorting the set of textual responses by: selecting, from aresponse database, textual responses satisfying a textual-quality-scorethreshold; and sorting the selected textual responses based on therelevancy parameter for each textual response from the selected textualresponses to create a sorted set of textual responses.

Further, in some cases, the acts 900 further include generating, fromthe sorted set of textual responses, the first response group of textualresponses corresponding to a first range of sentiment indicators and thesecond response group of textual responses corresponding to a secondrange of sentiment indicators based on the sentiment indicator and thetopic for each textual response from the sorted set of textualresponses.

As suggested above, in some embodiments, the acts 900 includedetermining the sentiment indicator for each textual response bydetermining, for each textual response, a sentiment score indicating apositive sentiment or a negative sentiment; determining, for eachtextual response, a sentiment-polarity indicator based on the sentimentscore for each textual response satisfying or not satisfying apositive-sentiment-score threshold or a negative-sentiment-scorethreshold; and generating the first response group of textual responsesand the second response group of textual responses based on thesentiment score, the sentiment-polarity indicator, and the topic foreach textual response from the set of textual responses.

Further, in certain implementations, the acts 900 include determiningthe relevancy parameter for each textual response from the set oftextual responses by determining, for each textual response, atopic-similarity score indicating a relevance of the textual response toa queried topic and a time identifier indicating a time associated withthe textual response; selecting, for display on the client device, thefirst representative-textual response from the first response groupbased on the textual quality score, the topic-similarity score, and thetime identifier for each textual response within the first responsegroup; and selecting, for display on the client device, the secondrepresentative-textual response from the second response group based onthe textual quality score, the topic-similarity score, and the timeidentifier for each textual response within the second response group.

As suggested above, in some embodiments, the intelligent-text-insightsystem 106 provides similar-response options. For example, in somecases, the acts 900 include providing, for display on the client device,a similar-response option corresponding to the firstrepresentative-textual response to view additional textual responsesfrom the set of textual responses similar to the firstrepresentative-textual response; receiving, from the client device, anindication of a user selection of the similar-response option; and inresponse to receiving the indication of the user selection, provide, fordisplay on the client device, a third textual response from the firstresponse group based on the relevancy parameter for each textualresponse within the first response group.

In addition (or in the alternative) to applying the text-qualityclassifier, in some embodiments, the intelligent-text-insight system 106trains a text-quality classifier. For instance, in certainimplementations, the acts 900 include training the text-qualityclassifier by: identifying textual feature dimensions for thetext-quality classifier to analyze upon receipt of textual responses;providing a textual-training response to the text-quality classifier togenerate a textual-quality-training score for the textual-trainingresponse; determining a loss from a loss function based on a comparisonof the textual-quality-training score and a ground-truth-quality score;and adjusting network parameters of the text-quality classifier based onthe determined loss.

Embodiments of the present disclosure may comprise or utilize aspecial-purpose or general-purpose computer including computer hardware,such as, for example, one or more processors and system memory, asdiscussed in greater detail below. Embodiments within the scope of thepresent disclosure also include physical and other computer-readablemedia for carrying or storing computer-executable instructions and/ordata structures. In particular, one or more of the processes describedherein may be implemented at least in part as instructions embodied in anon-transitory computer-readable medium and executable by one or morecomputing devices (e.g., any of the media content access devicesdescribed herein). In general, a processor (e.g., a microprocessor)receives instructions, from a non-transitory computer-readable medium,(e.g., a memory, etc.), and executes those instructions, therebyperforming one or more processes, including one or more of the processesdescribed herein.

Computer-readable media can be any available media that can be accessedby a general purpose or special purpose computer system.Computer-readable media that store computer-executable instructions arenon-transitory computer-readable storage media (devices).Computer-readable media that carry computer-executable instructions aretransmission media. Thus, by way of example, and not limitation,embodiments of the disclosure can comprise at least two distinctlydifferent kinds of computer-readable media: non-transitorycomputer-readable storage media (devices) and transmission media.

Non-transitory computer-readable storage media (devices) includes RAM,ROM, EEPROM, CD-ROM, solid state drives (“SSDs”) (e.g., based on RAM),Flash memory, phase-change memory (“PCM”), other types of memory, otheroptical disk storage, magnetic disk storage or other magnetic storagedevices, or any other medium which can be used to store desired programcode means in the form of computer-executable instructions or datastructures and which can be accessed by a general purpose or specialpurpose computer.

A “network” is defined as one or more data links that enable thetransport of electronic data between computer systems and/or modulesand/or other electronic devices. When information is transferred, orprovided over a network or another communications connection (eitherhardwired, wireless, or a combination of hardwired or wireless) to acomputer, the computer properly views the connection as a transmissionmedium. Transmissions media can include a network and/or data linkswhich can be used to carry desired program code means in the form ofcomputer-executable instructions or data structures and which can beaccessed by a general purpose or special purpose computer. Combinationsof the above should also be included within the scope ofcomputer-readable media.

Further, upon reaching various computer system components, program codemeans in the form of computer-executable instructions or data structurescan be transferred automatically from transmission media tonon-transitory computer-readable storage media (devices) (or viceversa). For example, computer-executable instructions or data structuresreceived over a network or data link can be buffered in RAM within anetwork interface module (e.g., a “NIC”), and then eventuallytransferred to computer system RAM and/or to less volatile computerstorage media (devices) at a computer system. Thus, it should beunderstood that non-transitory computer-readable storage media (devices)can be included in computer system components that also (or evenprimarily) utilize transmission media.

Computer-executable instructions comprise, for example, instructions anddata which, when executed at a processor, cause a general-purposecomputer, special purpose computer, or special purpose processing deviceto perform a certain function or group of functions. In one or moreembodiments, computer-executable instructions are executed on ageneral-purpose computer to turn the general-purpose computer into aspecial purpose computer implementing elements of the disclosure. Thecomputer executable instructions may be, for example, binaries,intermediate format instructions such as assembly language, or evensource code. Although the subject matter has been described in languagespecific to structural marketing features and/or methodological acts, itis to be understood that the subject matter defined in the appendedclaims is not necessarily limited to the described marketing features oracts described above. Rather, the described marketing features and actsare disclosed as example forms of implementing the claims.

Those skilled in the art will appreciate that the disclosure may bepracticed in network computing environments with many types of computersystem configurations, including, personal computers, desktop computers,laptop computers, message processors, hand-held devices, multi-processorsystems, microprocessor-based or programmable consumer electronics,network PCs, minicomputers, mainframe computers, mobile telephones,PDAs, tablets, pagers, routers, switches, and the like. The disclosuremay also be practiced in distributed system environments where local andremote computer systems, which are linked (either by hardwired datalinks, wireless data links, or by a combination of hardwired andwireless data links) through a network, both perform tasks. In adistributed system environment, program modules may be located in bothlocal and remote memory storage devices.

Embodiments of the present disclosure can also be implemented in cloudcomputing environments. In this description, “cloud computing” isdefined as a subscription model for enabling on-demand network access toa shared pool of configurable computing resources. For example, cloudcomputing can be employed in the marketplace to offer ubiquitous andconvenient on-demand access to the shared pool of configurable computingresources. The shared pool of configurable computing resources can berapidly provisioned via virtualization and released with low managementeffort or service provider interaction, and then scaled accordingly.

A cloud-computing subscription model can be composed of variouscharacteristics such as, for example, on-demand self-service, broadnetwork access, resource pooling, rapid elasticity, measured service,and so forth. A cloud-computing subscription model can also exposevarious service subscription models, such as, for example, Software as aService (“SaaS”), a web service, Platform as a Service (“PaaS”), andInfrastructure as a Service (“IaaS”). A cloud-computing subscriptionmodel can also be deployed using different deployment subscriptionmodels such as private cloud, community cloud, public cloud, hybridcloud, and so forth. In this description and in the claims, a“cloud-computing environment” is an environment in which cloud computingis employed.

FIG. 10 illustrates a block diagram of an exemplary computing device1000 that may be configured to perform one or more of the processesdescribed above. One will appreciate that one or more computing devicessuch as the computing device 1000 may implement the administrator device109, the recipient devices 114 a-114 n, the server device(s) 102, and/orother devices described above in connection with FIG. 1. As shown byFIG. 10, the computing device 1000 can comprise a processor 1002, amemory 1004, a storage device 1006, an I/O interface 1008, and acommunication interface 1010, which may be communicatively coupled byway of a communication infrastructure 1012. While the exemplarycomputing device 1000 is shown in FIG. 10, the components illustrated inFIG. 10 are not intended to be limiting. Additional or alternativecomponents may be used in other embodiments. Furthermore, in certainembodiments, the computing device 1000 can include fewer components thanthose shown in FIG. 10. Components of the computing device 1000 shown inFIG. 10 will now be described in additional detail.

In one or more embodiments, the processor 1002 includes hardware forexecuting instructions, such as those making up a computer program. Asan example and not by way of limitation, to execute instructions, theprocessor 1002 may retrieve (or fetch) the instructions from an internalregister, an internal cache, the memory 1004, or the storage device 1006and decode and execute them. In one or more embodiments, the processor1002 may include one or more internal caches for data, instructions, oraddresses. As an example and not by way of limitation, the processor1002 may include one or more instruction caches, one or more datacaches, and one or more translation lookaside buffers (“TLBs”).Instructions in the instruction caches may be copies of instructions inthe memory 1004 or the storage device 1006.

The memory 1004 may be used for storing data, metadata, and programs forexecution by the processor(s). The memory 1004 may include one or moreof volatile and non-volatile memories, such as Random-Access Memory(“RAM”), Read Only Memory (“ROM”), a solid-state disk (“SSD”), Flash,Phase Change Memory (“PCM”), or other types of data storage. The memory1004 may be internal or distributed memory.

The storage device 1006 includes storage for storing data orinstructions. As an example and not by way of limitation, storage device1006 can comprise a non-transitory storage medium described above. Thestorage device 1006 may include a hard disk drive (“HDD”), a floppy diskdrive, flash memory, an optical disc, a magneto-optical disc, magnetictape, or a Universal Serial Bus (“USB”) drive or a combination of two ormore of these. The storage device 1006 may include removable ornon-removable (or fixed) media, where appropriate. The storage device1006 may be internal or external to the computing device 1000. In one ormore embodiments, the storage device 1006 is non-volatile, solid-statememory. In other embodiments, the storage device 1006 includes read-onlymemory (“ROM”). Where appropriate, this ROM may be mask programmed ROM,programmable ROM (“PROM”), erasable PROM (“EPROM”), electricallyerasable PROM (“EEPROM”), electrically alterable ROM (“EAROM”), or flashmemory or a combination of two or more of these.

The I/O interface 1008 allows a user to provide input to, receive outputfrom, and otherwise transfer data to and receive data from the computingdevice 1000. The I/O interface 1008 may include a mouse, a keypad or akeyboard, a touch screen, a camera, an optical scanner, networkinterface, modem, other known I/O devices or a combination of such I/Ointerfaces. The I/O interface 1008 may include one or more devices forpresenting output to a user, including, but not limited to, a graphicsengine, a display (e.g., a display screen), one or more output drivers(e.g., display drivers), one or more audio speakers, and one or moreaudio drivers. In certain embodiments, the I/O interface 1008 isconfigured to provide graphical data to a display for presentation to auser. The graphical data may be representative of one or more graphicaluser interfaces and/or any other graphical content as may serve aparticular implementation.

The communication interface 1010 can include hardware, software, orboth. In any event, the communication interface 1010 can provide one ormore interfaces for communication (such as, for example, packet-basedcommunication) between the computing device 1000 and one or more othercomputing devices or networks. As an example and not by way oflimitation, the communication interface 1010 may include a networkinterface controller (“NIC”) or network adapter for communicating withan Ethernet or other wire-based network or a wireless NIC (“WNIC”) orwireless adapter for communicating with a wireless network, such as aWI-FI.

Additionally, or alternatively, the communication interface 1010 mayfacilitate communications with an ad hoc network, a personal areanetwork (“PAN”), a local area network (“LAN”), a wide area network(“WAN”), a metropolitan area network (“MAN”), or one or more portions ofthe Internet or a combination of two or more of these. One or moreportions of one or more of these networks may be wired or wireless. Asan example, the communication interface 1010 may facilitatecommunications with a wireless PAN (“WPAN”) (such as, for example, aBLUETOOTH WPAN), a WI-FI network, a WI-MAX network, a cellular telephonenetwork (such as, for example, a Global System for Mobile Communications(“GSM”) network), or other suitable wireless network or a combinationthereof.

Additionally, the communication interface 1010 may facilitatecommunications various communication protocols. Examples ofcommunication protocols that may be used include, but are not limitedto, data transmission media, communications devices, TransmissionControl Protocol (“TCP”), Internet Protocol (“IP”), File TransferProtocol (“FTP”), Telnet, Hypertext Transfer Protocol (“HTTP”),Hypertext Transfer Protocol Secure (“HTTPS”), Session InitiationProtocol (“SIP”), Simple Object Access Protocol (“SOAP”), ExtensibleMark-up Language (“XML”) and variations thereof, Simple Mail TransferProtocol (“SMTP”), Real-Time Transport Protocol (“RTP”), User DatagramProtocol (“UDP”), Global System for Mobile Communications (“GSM”)technologies, Code Division Multiple Access (“CDMA”) technologies, TimeDivision Multiple Access (“TDMA”) technologies, Short Message Service(“SMS”), Multimedia Message Service (“MMS”), radio frequency (“RF”)signaling technologies, Long Term Evolution (“LTE”) technologies,wireless communication technologies, in-band and out-of-band signalingtechnologies, and other suitable communications networks andtechnologies.

The communication infrastructure 1012 may include hardware, software, orboth that couples components of the computing device 1000 to each other.As an example and not by way of limitation, the communicationinfrastructure 1012 may include an Accelerated Graphics Port (“AGP”) orother graphics bus, an Enhanced Industry Standard Architecture (“EISA”)bus, a front-side bus (“FSB”), a HYPERTRANSPORT (“HT”) interconnect, anIndustry Standard Architecture (“ISA”) bus, an INFINIBAND interconnect,a low-pin-count (“LPC”) bus, a memory bus, a Micro Channel Architecture(“MCA”) bus, a Peripheral Component Interconnect (“PCI”) bus, aPCI-Express (“PCIe”) bus, a serial advanced technology attachment(“SATA”) bus, a Video Electronics Standards Association local (“VLB”)bus, or another suitable bus or a combination thereof.

FIG. 11 illustrates an example network environment 1100 of the digitalsurvey system 104. Network environment 1100 includes a client device1106, and a server device 1102 connected to each other by a network1104. Although FIG. 11 illustrates a particular arrangement of clientdevice 1106, server device 1102, and network 1104, this disclosurecontemplates any suitable arrangement of client device 1106, serverdevice 1102, and network 1104. As an example and not by way oflimitation, two or more of the client devices 1106, and server devices1102 may be connected to each other directly, bypassing network 1104. Asanother example, two or more of client devices 1106 and server devices1102 may be physically or logically co-located with each other in whole,or in part. Moreover, although FIG. 11 illustrates a particular numberof client devices 1106, server devices 1102, and networks 1104, thisdisclosure contemplates any suitable number of client devices 1106,server devices 1102, and networks 1104. As an example and not by way oflimitation, network environment 1100 may include multiple client devices1106, server devices 1102, and networks 1104.

This disclosure contemplates any suitable network 1104. As an exampleand not by way of limitation, one or more portions of network 1104 mayinclude an ad hoc network, an intranet, an extranet, a virtual privatenetwork (“VPN”), a local area network (“LAN”), a wireless LAN (“WLAN”),a wide area network (“WAN”), a wireless WAN (“WWAN”), a metropolitanarea network (“MAN”), a portion of the Internet, a portion of the PublicSwitched Telephone Network (“PSTN”), a cellular telephone network, or acombination of two or more of these. Network 1104 may include one ormore networks 1104.

Links may connect client device 1106, and server device 1102 to network1104 or to each other. This disclosure contemplates any suitable links.In particular embodiments, one or more links include one or morewireline (such as for example Digital Subscriber Line (“DSL”) or DataOver Cable Service Interface Specification (“DOCSIS”)), wireless (suchas for example Wi-Fi or Worldwide Interoperability for Microwave Access(“WiMAX”)), or optical (such as for example Synchronous Optical Network(SONET) or Synchronous Digital Hierarchy (“SDH”)) links. In particularembodiments, one or more links each include an ad hoc network, anintranet, an extranet, a VPN, a LAN, a WLAN, a WAN, a WWAN, a MAN, aportion of the Internet, a portion of the PSTN, a cellulartechnology-based network, a satellite communications technology-basednetwork, another link, or a combination of two or more such links. Linksneed not necessarily be the same throughout network environment 1100.One or more first links may differ in one or more respects from one ormore second links.

In particular embodiments, client device 1106 may be an electronicdevice including hardware, software, or embedded logic components or acombination of two or more such components and capable of carrying outthe appropriate functionalities implemented or supported by clientdevice 1106. As an example and not by way of limitation, a client device1106 may include any of the computing devices discussed above inrelation to FIG. 10. A client device 1106 may enable a network user atclient device 1106 to access network 1104.

In particular embodiments, client device 1106 may include a web browser,such as MICROSOFT INTERNET EXPLORER, GOOGLE CHROME, or MOZILLA FIREFOX,and may have one or more add-ons, plug-ins, or other extensions, such asTOOLBAR or YAHOO TOOLBAR. A user at client device 1106 may enter aUniform Resource Locator (“URL”) or other address directing the webbrowser to a particular server (such as server, or a server associatedwith a third-party system), and the web browser may generate a HyperText Transfer Protocol (“HTTP”) request and communicate the HTTP requestto server. The server may accept the HTTP request and communicate toclient device 1106 one or more Hyper Text Markup Language (“HTML”) filesresponsive to the HTTP request. Client device 1106 may render a webpagebased on the HTML files from the server for presentation to the user.This disclosure contemplates any suitable webpage files. As an exampleand not by way of limitation, webpages may render from HTML files,Extensible Hyper Text Markup Language (“XHTML”) files, or ExtensibleMarkup Language (“XML”) files, according to particular needs. Such pagesmay also execute scripts such as, for example and without limitation,those written in JAVASCRIPT, JAVA, MICROSOFT SILVERLIGHT, combinationsof markup language and scripts such as AJAX (Asynchronous JAVASCRIPT andXML), and the like. Herein, reference to a webpage encompasses one ormore corresponding webpage files (which a browser may use to render thewebpage) and vice versa, where appropriate.

In particular embodiments, server device 1102 may include a variety ofservers, sub-systems, programs, modules, logs, and data stores. Inparticular embodiments, server device 1102 may include one or more ofthe following: a web server, action logger, API-request server,relevance-and-ranking engine, content-object classifier, notificationcontroller, action log, third-party-content-object-exposure log,inference module, authorization/privacy server, search module,advertisement-targeting module, user-interface module, user-profilestore, connection store, third-party content store, or location store.Server device 1102 may also include suitable components such as networkinterfaces, security mechanisms, load balancers, failover servers,management-and-network-operations consoles, other suitable components,or any suitable combination thereof.

In particular embodiments, server device 1102 may include one or moreuser-profile stores for storing user profiles. A user profile mayinclude, for example, biographic information, demographic information,behavioral information, social information, or other types ofdescriptive information, such as work experience, educational history,hobbies or preferences, interests, affinities, or location. Interestinformation may include interests related to one or more categories.Categories may be general or specific. Additionally, a user profile mayinclude financial and billing information of users (e.g., users 116 aand 116 n, customers, etc.).

The foregoing specification is described with reference to specificexemplary embodiments thereof. Various embodiments and aspects of thedisclosure are described with reference to details discussed herein, andthe accompanying drawings illustrate the various embodiments. Thedescription above and drawings are illustrative and are not to beconstrued as limiting. Numerous specific details are described toprovide a thorough understanding of various embodiments.

The additional or alternative embodiments may be embodied in otherspecific forms without departing from its spirit or essentialcharacteristics. The described embodiments are to be considered in allrespects only as illustrative and not restrictive. The scope of theinvention is, therefore, indicated by the appended claims rather than bythe foregoing description. All changes that come within the meaning andrange of equivalency of the claims are to be embraced within theirscope.

We claim:
 1. A system comprising: at least one processor; at least onenon-transitory computer readable storage medium storing instructionsthat, when executed by the at least one processor, cause the system to:provide a set of textual responses to a response-extraction-neuralnetwork to generate a set of extracted sentences, wherein each extractedsentence corresponds to a selected sentence from the set of textualresponses; determine a sentiment indicator for each extracted sentencefrom the set of extracted sentences; sort the set of extracted sentencesaccording to the sentiment indicator for each extracted sentence togenerate a sorted set of extracted sentences; based on the sorted set ofextracted sentences, generate a first cluster of extracted sentencescorresponding to a first context and a second cluster of extractedsentences corresponding to a second context; and generate a responsesummary for the set of textual responses based on a first extractedsentence from the first cluster of extracted sentences and a secondextracted sentence from the second cluster of extracted sentences. 2.The system of claim 1, further comprising instructions that, whenexecuted by the at least one processor, cause the system to: provide theset of textual responses to the response-extraction-neural network by:receiving, from a client device, a user query searching for textualresponses corresponding to a time period and responding to a surveyquestion; and selecting the set of textual responses based on a timeidentifier for each textual response corresponding to the time periodindicated by the user query; and determine the sentiment indicator foreach extracted sentence from the set of extracted sentences bydetermining, for each extracted sentence, a sentiment score indicating apositive sentiment or a negative sentiment.
 3. The system of claim 1,further comprising instructions that, when executed by the at least oneprocessor, cause the system to: provide the set of textual responses tothe response-extraction-neural network by: receiving, from a clientdevice, a user query searching for textual responses corresponding to atopic and responding to a survey question; and selecting the set oftextual responses based on a relevance of each textual response from theset of textual responses to the user query; and determine the sentimentindicator for each extracted sentence from the set of extractedsentences by determining, for each extracted sentence, a sentiment scoreindicating a positive sentiment corresponding to the topic or a negativesentiment corresponding to the topic.
 4. The system of claim 1, furthercomprising instructions that, when executed by the at least oneprocessor, cause the system to: generate the first cluster of extractedsentences by identifying, from among the sorted set of extractedsentences, extracted sentences corresponding to a first sentimentindicator and to a first linguistic context embedding generated by anembedding algorithm; and generate the second cluster of extractedsentences by identifying, from among the sorted set of extractedsentences, extracted sentences corresponding to a second sentimentindicator and to a second linguistic context embedding generated by theembedding algorithm.
 5. The system of claim 1, further comprisinginstructions that, when executed by the at least one processor, causethe system to generate the set of extracted sentences by: extractingselected sentences from the set of textual responses utilizing anextractor-selection network of the response-extraction-neural network;and paraphrasing one or more sentences from the selected sentencesutilizing an abstractor network of the response-extraction-neuralnetwork.
 6. The system of claim 1, further comprising instructions that,when executed by the at least one processor, cause the system to:determine the sentiment indicator for each extracted sentence bydetermining a sentiment score indicating a positive sentiment or anegative sentiment for each extracted sentence; and generate the firstcluster of extracted sentences corresponding to the first context andthe second cluster of extracted sentences corresponding to the secondcontext by: generating the first cluster of extracted sentencescorresponding to positive sentiment scores and to a first linguisticcontext embedding; and generating the second cluster of extractedsentences corresponding to negative sentiment scores and to a secondlinguistic context embedding.
 7. The system of claim 1, furthercomprising instructions that, when executed by the at least oneprocessor, cause the system to select the first extracted sentence andthe second extracted sentence for the response summary by: selecting thefirst extracted sentence from the first cluster of extracted sentencesbased on a relevance of the first extracted sentence to a firstlinguistic context embedding; and selecting the second extractedsentence from the second cluster of extracted sentences based on arelevance of the second extracted sentence to a second linguisticcontext embedding.
 8. The system of claim 1, further comprisinginstructions that, when executed by the at least one processor, causethe system to generate the response summary for the set of textualresponses based on the first extracted sentence and the second extractedsentence by generating the response summary for the set of textualresponses comprising both the first extracted sentence and the secondextracted sentence.
 9. The system of claim 1, further comprisinginstructions that, when executed by the at least one processor, causethe system to generate the response summary for the set of textualresponses based on the first extracted sentence and the second extractedsentence by: selecting the first extracted sentence and a thirdextracted sentence from the first cluster of extracted sentences basedon a relevance of each of the first extracted sentence and the thirdextracted sentence to a first linguistic context embedding; selectingthe second extracted sentence and a fourth extracted sentence from thesecond cluster of extracted sentences based on a relevance of each ofthe second extracted sentence and the fourth extracted sentence to asecond linguistic context embedding; providing the first, second, third,and fourth extracted sentences to a summarization neural network togenerate: a first compressed sentence representing a summary of thefirst extracted sentence and the third extracted sentence: and a secondcompressed sentence representing a summary of the second extractedsentence and the fourth extracted sentence; and generating the responsesummary for the set of textual responses comprising the first compressedsentence and the second compressed sentence.
 10. A non-transitorycomputer readable medium storing instructions thereon that, whenexecuted by at least one processor, cause a computer system to: providea set of textual responses to a response-extraction-neural network togenerate a set of extracted sentences, wherein each extracted sentencecorresponds to a selected sentence from the set of textual responses;determine a sentiment indicator for each extracted sentence from the setof extracted sentences; sort the set of extracted sentences according tothe sentiment indicator for each extracted sentence to generate a sortedset of extracted sentences; based on the sorted set of extractedsentences, generate a first cluster of extracted sentences correspondingto a first context and a second cluster of extracted sentencescorresponding to a second context; and generate a response summary forthe set of textual responses based on a first extracted sentence fromthe first cluster of extracted sentences and a second extracted sentencefrom the second cluster of extracted sentences.
 11. The non-transitorycomputer readable medium of claim 10, further comprising instructionsthat, when executed by the at least one processor, cause the computersystem to: generate the response summary for the set of textualresponses in response to a query for textual responses corresponding toa time period and responding to a survey question; generate anadditional response summary for an additional set of textual responsesin response to an additional query for additional textual responsescorresponding to an additional time period and responding to the surveyquestion; and provide, for display on a client device, a summarycomparison of the response summary for the set of textual responses andthe additional response summary for the additional set of textualresponses.
 12. The non-transitory computer readable medium of claim 10,further comprising instructions that, when executed by the at least oneprocessor, cause the computer system to determine the sentimentindicator for each extracted sentence by determining, for each extractedsentence, a sentiment score indicating a positive sentiment or anegative sentiment.
 13. The non-transitory computer readable medium ofclaim 12, further comprising instructions that, when executed by the atleast one processor, cause the computer system to generate the sortedset of extracted sentences by: selecting, from the set of textualresponses, a first subset of extracted sentences corresponding tosentiment scores satisfying a positive-sentiment-score threshold forpositive sentiments; and selecting, from the set of textual responses, asecond subset of extracted sentences corresponding to sentiment scoressatisfying a negative-sentiment-score threshold for negative sentiments.14. The non-transitory computer readable medium of claim 13, furthercomprising instructions that, when executed by the at least oneprocessor, cause the computer system to generate the first cluster ofextracted sentences corresponding to the first context and the secondcluster of extracted sentences corresponding to the second context by:applying a smooth-inverse-frequency algorithm to the first subset ofextracted sentences to generate a first linguistic context embedding anda third linguistic context embedding; generating the first cluster ofextracted sentences by identifying, from among the first subset ofextracted sentences, extracted sentences corresponding to the firstlinguistic context embedding; applying the smooth-inverse-frequencyalgorithm to the second subset of extracted sentences to generate asecond linguistic context embedding; and generating the second clusterof extracted sentences by identifying, from among the second subset ofextracted sentences, extracted sentences corresponding to the secondlinguistic context embedding.
 15. The non-transitory computer readablemedium of claim 14, further comprising instructions that, when executedby the at least one processor, cause the computer system to select thefirst extracted sentence and the second extracted sentence for theresponse summary by: selecting the first extracted sentence from thefirst cluster of extracted sentences based on a maximum marginalrelevance of the first extracted sentence to the first linguisticcontext embedding; and selecting the second extracted sentence from thesecond cluster of extracted sentences based on a maximum marginalrelevance of the second extracted sentence to the second linguisticcontext embedding.
 16. The non-transitory computer readable medium ofclaim 10, further comprising instructions that, when executed by the atleast one processor, cause the computer system to generate the responsesummary for the set of textual responses based on the first extractedsentence and the second extracted sentence by generating the responsesummary for the set of textual responses comprising a portion of thefirst extracted sentence and a portion of the second extracted sentence.17. A method comprising: providing a set of textual responses to aresponse-extraction-neural network to generate a set of extractedsentences, wherein each extracted sentence corresponds to a selectedsentence from the set of textual responses; determining a sentimentindicator for each extracted sentence from the set of extractedsentences; sorting the set of extracted sentences according to thesentiment indicator for each extracted sentence to generate a sorted setof extracted sentences; based on the sorted set of extracted sentences,generating a first cluster of extracted sentences corresponding to afirst context and a second cluster of extracted sentences correspondingto a second context; and generating a response summary for the set oftextual responses based on a first extracted sentence from the firstcluster of extracted sentences and a second extracted sentence from thesecond cluster of extracted sentences.
 18. The method of claim 17:wherein providing the set of textual responses to theresponse-extraction-neural network comprises: receiving, from a clientdevice, a user query searching for textual responses corresponding to atime period and responding to a survey question; and selecting the setof textual responses based on a time identifier for each textualresponse corresponding to the time period indicated by the user query;and determining the sentiment indicator for each extracted sentence fromthe set of extracted sentences comprises determining, for each extractedsentence, a sentiment score indicating a positive sentiment or anegative sentiment.
 19. The method of claim 17, wherein providing theset of textual responses to the response-extraction-neural networkcomprises: receiving, from a client device, a user query searching fortextual responses corresponding to a topic and responding to a surveyquestion; and selecting the set of textual responses based on arelevance of each textual response from the set of textual responses tothe user query; and determining the sentiment indicator for eachextracted sentence from the set of extracted sentences comprisesdetermining, for each extracted sentence, a sentiment score indicating apositive sentiment corresponding to the topic or a negative sentimentcorresponding to the topic.
 20. The method of claim 19, furthercomprising providing the response summary for the set of textualresponses to the client device for display within a graphical userinterface.